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Non-Causally at the Relay 

Abdellatif Zaidi Shiva Prasad Kotagiri J. Nicholas Laneman Luc Vandendorpe 

Abstract 

We consider a three-terminal state-dependent relay channel with the channel state noncausally 
available at only the relay. Such a model may be useful for designing cooperative wireless networks 
with some terminals equipped with cognition capabilities, i.e., the relay in our setup. In the discrete 
memoryless (DM) case, we establish lower and upper bounds on channel capacity. The lower bound 
is obtained by a coding scheme at the relay that uses a combination of codeword splitting, Gel'fand- 
Pinsker binning, and decode-and-forward relaying. The upper bound improves upon that obtained by 
assuming that the channel state is available at the source, the relay, and the destination. For the Gaussian 
case, we also derive lower and upper bounds on the capacity. The lower bound is obtained by a coding 
scheme at the relay that uses a combination of codeword splitting, generalized dirty paper coding, and 
decode-and-forward relaying; the upper bound is also better than that obtained by assuming that the 
channel state is available at the source, the relay, and the destination. In the case of degraded Gaussian 
channels, the lower bound meets with the upper bound for some special cases, and, so, the capacity is 
obtained for these cases. Furthermore, in the Gaussian case, we also extend the results to the case in 
which the relay operates in a half-duplex mode. 
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I. Introduction 

We consider a three-terminal state-dependent relay channel (RC) in which, as shown in 
Figure [Tj the source wants to communicate a message W to the destination through the state- 
dependent RC in n uses of the channel, with the help of the relay. The channel outputs Y 2 and Y 3 
for the relay and the destination, respectively, are controlled by the channel input X\, the relay 
input X 2 and the channel state S, through a given memoryless probability law Wy 2 ,y 3 \x 1 ,x 2 ,s- The 
channel state S is generated according to a given memoryless probability law Qs- It is assumed 
that the channel state is known, noncausally, to only the relay. The destination estimates the 
message sent by the source from the received channel output. In this paper we study the capacity 
of this communication system. We refer to the model under investigation as state-dependent RC 
with informed relay. 
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Fig. 1. Relay channel with state information S n available noncausally at only the relay. 



A. Background 

Channels with random parameters or states have received considerable attention due to a 
wide range of possible applications. Shannon initiated the study of single-user models with 
state available causally at the encoder [1]. For the single-user discrete memoryless (DM) state- 
dependent models, Gel'fand and Pinsker derive the capacity for the setup in which the channel 
state is available noncausally at the encoder [2]. In this case, a random coding scheme based 
on binning, known as Gel'fand-Pinsker coding, achieves the capacity [2]. Costa considers an 
additive Gaussian channel with additive Gaussian state known at the encoder and shows that 
Gel'fand-Pinsker coding with a specific auxiliary random variable, widely known as dirty paper 
coding (DPC), achieves the trivial upper bound obtained by assuming the channel state available 
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also at the decoder [3]. Interestingly, DPC eliminates the effect of the additive channel state 
on the capacity, as if there were no channel state present in the model or the channel state 
were known to the decoder as well. It is worthnoting that since DPC achieves the trivial upper 
bound for this model there is no need to derive tighter upper bounds in this case. In [4], models 
with channel state available noncausally at the encoder are studied from the perspective of 
memories with defects. Practical coding realizations using concepts of lattices for the models 
with noncausal encoder state information are studied, e.g., in [5], [6]. For a review on the subject 
of state-dependent channels and related work, the reader may refer to [7]. 

A growing body of work studies multi-user state-dependent models with noncausal encoder 
state information [8]— [21]. In the multi-user models, the channel state can be known to all, only 
some, or none of the users in the communication system. In the case of state-dependent DM 
models, the multiple access channel (MAC) with partial channel state at all the encoders and 
full channel state at the decoder is considered in [11], and the broadcast channel (BC) with state 
available at the encoder but not at the decoders is considered in [12], [22]. 

In the Gaussian case, the MAC with all encoders being informed, the BC with informed 
encoder, the physically degraded relay channel (RC) with informed source and informed relay, 
and the physically degraded relay broadcast channel (RBC) with informed source and informed 
relay are studied in [8], [9], [18]. In all these cases, it is shown that some variants of DPC achieve 
the respective capacity or capacity region. Also, since for all these models DPC achieves the 
trivial upper or outer bound obtained by assuming that the channel state is also available at the 
decoders, it is not necessary to derive non-trivial upper or outer bounds, i.e., bounds that are 
tighter than the cut-set bound. For all these models, the key assumption that makes the problem 
relatively easy is the availability of the channel state at all the encoders in the communication 
model, which allows these encoders to remove the effect of the channel state on their respective 
communication using variants of DPC. It is interesting to study state-dependent multi-user models 
in which some, but not all, encoders are informed of the channel state, because the uninformed 
encoders cannot apply DPC. 

The state-dependent MAC with some, but not all, encoders informed of the channel state is 
considered in [10], [13]— [16], [21] and the state-dependent relay channel with informed source is 
considered in [18], [19]. For the Gaussian cases of these models, the informed encoder applies a 
slightly generalized DPC (GDPC) in which the channel input and the channel state are negatively 
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correlated. It is interesting to note that in these models the uninformed encoders benefit from 
the GDPC applied by the informed encoders because the negative correlation can be viewed as 
partial state cancellation. The capacity region of the discrete memoryless state-dependent MAC 
with one informed encoder is characterized in [14], [16] for the case in which the messages sets 
are degraded and the informed encoder knows the message of the uninformed encoder. In [16], 
the authors also study the Gaussian case and they characterize the capacity region by deriving 
a non-trivial outer bound that is strictly tighter than the cut-set outer bound. 

For the study of communication models in which only some of the involved encoders are 
informed about the channel state, it is important to establish non-trivial upper or outer bounds. 
These bounds help characterize the rate loss due to not knowing the state at the uninformed 
encoders; and help assess the effectiveness of the coding schemes that are employed for the 
achievability results. In this paper, we study a state-dependent relay channel with the channel 
state known to only the relay. This model is conceptually different from the model considered 
previously in [18], [19] in which the channel state is noncausally known to only the source. 

B. Motivation 

Channels whose probabilistic input-output relationship depends on random parameters, or 
channel states, can model a large variety of scenarios. The assumption of noncausal channel 
state can hold naturally or approximately. Examples where the assumption of noncausal state 
holds naturally include information embedding [23]-[28], certain storage applications such as 
computer memories with defective cells [29] and certain broadcast scenarios such as multiple- 
input multiple-output (MIMO) broadcast channels [30]-[32] where DPC is a central ingredient 
in achieving the capacity region [33]. Examples where the assumption of noncausal state holds 
approximately include dispersive (IS I) channels [5], block fading in wireless environments [34], 
network [35] and cooperative networks [36]. 

Yet, another example application is cooperation in the realm of cognition. Driven by the 
growing demand for frequency spectrum, cognitive radios, usually defined as smart radio devices 
that are capable of acquiring some knowledge about the channel state, are introduced into 
communication systems in order to help non-cognitive radios in terms of spectral efficiency 
[37]. In a wireless interference network in which some terminals compete and some others 
cooperate, equipping some specific terminals with cognition capabilities that allow them to learn 
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the interference to high accuracy would help other non-cognitive terminals. These cognitive 
radios can exploit the knowledge of the interference or channel state to remove its effect on the 
transmission of their own messages and also that of the messages of the non-cognitive terminals 
as well. The study of fundamental performance limits of models with only a subset of the 
encoders being informed is relevant for a better understanding of communication systems that 
involve cognitive radios. For example, to increase system spectral efficiency, collaboration is 
investigated in the realm of cognition in [38]-[40]. Also, the problem of collaborative signal 
transmission in the presence of some cognizant terminals is investigated for a MAC scenario 
in [13], [16] and for an interference channel scenario in [41]-[44]. The setup we consider in 
this paper also models the building block for collaborative wireless networks in which only the 
relays, but neither sources nor destinations, are cognizant of the channel state. An example of 
such a scenario is shown in Figure [2} 




Fig. 2. Example wireless network with cognition capabilities. If the relay T2 is cognizant of the competing source To, it can 
help the source Ti cancel the effect of the interference from To. 



C. Main Contributions 

For the DM case, we derive lower and upper bounds on the capacity of the general state- 
dependent relay channel with informed relay. The lower bound is obtained by a coding scheme 
at the relay that uses a combination of codeword splitting, Gel'fand-Pinsker coding, and decode- 
and-forward (DF) relaying. For this model, designing a codebook at the relay is challenging 
since such a codebook should allow the source to generate codewords that are correlated with 
the channel input of the relay which exploits the available channel state. In this work, this is 

November 24, 2009 DRAFT 



6 



accomplished by codeword splitting at the relay. With codeword splitting, the channel input of 
the relay is generated from two codewords: the first of which is a function of the cooperative 
information (i.e, the information that is sent cooperatively by the source and the relay using 
a joint codebook) and the channel state, and the second of which is a function of only the 
cooperative information. Since the source knows the cooperative information, it can generate its 
channel input in a way such that it is correlated with the latter codeword at the relay, which is 
a function of only the cooperative information. 

Our upper bound on the capacity is tighter than that obtained by assuming that the channel 
state is also available at the source and the destination. This upper bound is non-trivial and 
relates to the bounding technique developed in the context of multiple access channels with 
asymmetric channel state in [16, Theorem 2]; however, we note that the present upper bound is 
proved using techniques that are different from those in [16]. On a related note, we mention that 
at a high level there is a connection between the multiple access transmission part in the RC 
with informed relay in this work and the models in [13], [16]. However, there are also numerous 
conceptual differences that will be discussed whenever relevant. In particular, in contrast to [13], 
[16], here the uninformed encoder (the source) knows the message of the informed encoder (the 
relay). From this angle, the model in this paper connects more with the state-dependent MAC 
studied in [21]. 

Furthermore, we specialize the results to the case in which the channel is degraded. Also, we 
extend the lower bound for the DF relaying scheme to the case in which the source implements 
rate- splitting and the relay decodes-and-forwards only one part of the source message. 

We apply the concepts developed in the DM case to the Gaussian case in which both the 
noise and the state are additive Gaussian random variables. In our analysis for the Gaussian RC, 
we first allow the relay to operate in a full-duplex mode in which it can transmit and receive 
simultaneously, and then we constrain it to operate in a half-duplex mode in which it can either 
only transmit or only receive. 

In the case of full-duplex transmission, we derive lower and upper bounds on the capacity 
of the Gaussian relay channel with informed relay. We obtain two lower bounds by using the 
concepts of codeword splitting, generalized DPC (GDPC) [10], [45], and decode-and-forward 
relaying. Through codeword splitting, the channel input of the source is partially coherent with 
the channel input of the relay. The first lower bound uses full DF at the relay and the second 
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further enlarges it by allowing rate-splitting at the source. 

We also point out the loss incurred by the availability of the channel state at only the relay 
in the upper bound. We show that the lower bound obtained with rate splitting at the source is 
in general close to the upper bound for general Gaussian channels. In the case of the degraded 
Gaussian channel, the two lower bounds meet and they meet with the upper bound for some 
special cases. 

In the case of half-duplex transmission, we derive lower and upper bounds for the capacity 
of the Gaussian relay channel with informed relay. In this case, we focus on relaying protocols 
in which the relay either fully or partially decodes the source message, re-encodes and sends it 
to the destination. 

D. Outline and Notation 

An outline of the remainder of this paper is as follows. Section |n] describes in mode detail 
the communication model that we consider in this work. Section [HI] provides lower and upper 



bounds on the capacity of the general discrete memoryless RC with informed relay. Section IV 
provides lower and upper bounds on the capacity of the Gaussian RC with informed relay, and 
also contains some numerical results and discussions. Finally, Section [V] concludes the paper. 

We use the following notations throughout the paper. Upper case letters are used to denote 
random variables, e.g., X; lower case letters are used to denote realizations of random variables, 
e.g., x; and calligraphic letters designate alphabets, i.e., X. The probability distribution of a 
random variable X is denoted by Px{x). Sometimes, for convenience, we write it as Px- We 
use the notation E_y[-] to denote the expectation of random variable X. A probability distribution 
of a random variable Y given X is denoted by Py\x- The set of probability distributions defined 
on an alphabet X is denoted by !P(X). The cardinality of a set X is denoted by |X|. The short- 
hand notation X{ indicates a sequence of random variables (Xi,X i+1 , ■ ■ ■ ,Xj) and x\ denotes 
a particular realization of a random sequence X\ . For convenience, the length n vector x n will 
occasionally be denoted in boldface notation x. Given random variables X, Y, Z, we denote 
the entropy of X by H(X), the mutual information between X and Y by I(X;Y), and the 
conditional mutual information between X and Y, conditioned on Z, by I(X;Y\Z) [46]. The 
Gaussian distribution with mean \i and variance a 2 is denoted by a 2 ). Finally, throughout 
the paper, logarithms are taken to base 2, and the complement to unity of a scalar u G [0, 1] is 
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denoted by u, i.e., u—l — u. 

II. System Model and Definitions 
In this section, we formally present our communication model and the definitions related to 
it. As shown in Figure [TJ we consider a state-dependent relay channel denoted by Wy 2 ,y 3 \x 1 .x 2 ,s 
whose outputs Y 2 E V 2 and Y 3 E V 3 for the relay and the destination, respectively, are controlled 
by the channel inputs Xi E Xi from the source and X 2 E X 2 from the relay, along with a 
channel state S E §. It is assumed that the channel state Si at time instant % is independently 
drawn from a given distribution Qs and the channel state S n is noncausally known at the relay. 
Also, each transmitted input block from the source and each transmitted input block x 2 from 
the relay are subject to additive normalized input constraints 

n 

1=1 

where cp% : Xi — > IR + and <^ 2 : X2 — >• M + are single-letter input cost functions for the source 
and the relay, respectively. 

The source wants to transmit a message W to the destination with the help of the relay, 
in n channel uses. The message W is assumed to be uniformly distributed over the set W = 
{1, • • • , M}. The information rate R is defined as logM/n bits per transmission. 

An (n, M, Ti, T 2 ) code for the state-dependent relay channel with informed relay consists of 
an encoding function at the source 

{!,-■■ ,M}^ XI 
a sequence of encoding functions at the relay 

02,: V^xS^Xs, 
for i = 1, 2, . . . , n, and a decoding function at the destination 

r- vs-{i.-",m} 

such that 

n 
8=1 
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and 

1 n 

fox we {l,-- ,M}. 

From an (n, M, r 1? r 2 ) code, the sequences X™ and from the source and the relay, 
respectively, are transmitted across a state-dependent relay channel modeled as a memory- 
less conditional probability distribution W Y2 ,y 3 \x 1 ,x 2 ,s- The joint probability mass function on 
WxS n xX?xX2xy™xy™ is given by 

n 

P(w, s n , x n u xl yl = P(w) J] Qs(s i )P(x 1 , i \w)P(x 2ti \s n , y^ 1 ) 

i=i 

•WV 2> y 3 |x 1 ,j: 2) s(2/i2,i, S/3,i|a:i,i, afy, s^. (2) 

The channel is said to be physically degraded if the conditional distribution W Y2 ,y 3 \x 1: x 2 ,s 
factorizes as 

WV 2 ,r 3 |Xi,x 2 ,5 = Wy 2 \x u x 2 ,sWy 3 \y 2 ,x 2 ,s- (3) 

The destination estimates the message sent by the source from the channel output Yg. The 
average probability of error is defined as P™ = Pi[ip n (Y^ 1 ) ^ W\. 

An (e, n, P, Ti, T 2 ) code for the state-dependent RC with informed relay is an (n, ^2 nR ~\ , Ti, T 2 ) 
code (0™,02)^ n ) having average probability of error P™ not exceeding e. 

Given a pair T = (r 1? r 2 ), a rate R is said to be T-achievable if there exists a sequence of 
(e n , n, P, Ti, r 2 )-codes with Hindoo e n = 0. The capacity C(T) of the state-dependent RC with 
informed relay is the supremum of the set of T-achievable rates. 

III. The Discrete Memoryless RC with Informed Relay 

In this section, we assume that all the alphabets in the model, S, Xi, X 2 , ^ 2 and y 3 , are 
discrete and finite. 

A. Lower Bound on Capacity 

The following theorem provides a lower bound on the capacity of the state-dependent DM 
RC with informed relay. 
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Theorem 1: Let T = (ri,r 2 ) be given. The capacity C(T) of the state-dependent DM RC 
with informed relay satisfies C(T) > i? lo (r), where 

i? lo (r) = maxmm{l(X 1 ;Y 2 \S,U 1 ,X 2 ), 

I(X 1 ,U l ,U 2 ;Y 3 )-I(U 2 ;S\U 1 )}, (4) 

with the maximization over all probability distributions of the form 

Ps,Ui,U 2 ,X lt X 2 ,Y 2 ,Y 3 — 

QsPm Px x i u x Pu 2 \Ux,sPx 2 \Ux,u 2 ,sWy 2 ,y 3 \Xx ,x 2 ,s (5) 

and satisfying E[<^j(X;)] < Tj, i = 1,2, and U\ E Hi, U 2 E U 2 are auxiliary random variables 
with 

| Ui| < |S||Xi||X 2 | + 1 (6a) 
\U 2 \ < (|§||Xi||X 2 | + l)|S||Xi||X 2 |, (6b) 

respectively. 

Remark 1: The lower bound Q is based upon a technique at the relay we call codeword 
splitting, combining decode-and-forward (DF) relaying [47, Theorem 1] with Gel'fand-Pinsker 
coding [2]. In conventional DF strategies, the source knows the relay input, allowing the source 
and relay to utilize a joint codebook to transmit cooperative information. However, in our model 
there is a tension between the utility of a joint codebook for relaying and the utility of the 
relay's making use of the channel state, which is unknown to the source. To resolve this tension, 
we generate two codebooks at the relay. In one codebook, the codewords are generated using a 
random variable U\ that is independent of the channel state S. The relay chooses the appropriate 
codeword from this codebook using only the cooperative information. In the other codebook, 
the codewords are generated using a random variable U 2 that is correlated with the channel state 
S and the variable U\ through Pu 2 \Ux,s- The relay chooses the appropriate codeword from this 
codebook using both the cooperative information and the channel state, in order to combat the 
effect of the channel state on the communication. Finally, the relay generates the channel input 
X£ from (Ui,U 2 ) using the conditional probability law Px 2 \Ux,u 2 ,s- The source knows U" as 
this is a function of only the cooperative information, and, given C/f, it generates the random 
codeword X™ according to the conditional probability law Pxx\u x - Thus, the channel inputs of the 
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source and the relay are correlated through [/". A dependence diagram of the random variables 
that are involved in the coding scheme is shown in Figure [3j 
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x 2 \u u u 2 ,s 



X 2 
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P X!\U! 










■x 1 



Fig. 3. Dependence diagram of the random variables for the lower bound in Theorem [T] 



Remark 2: The term [I(Xi, Ui, U2; Y3)— l{Ui\ S\JJ\f\ in can be interpreted as an achievable 
sum rate over a state-dependent MAC with one informed encoder and degraded messages, i.e., 
one common and one individual message. In our model, the informed encoder sends only 
the common message, i.e., the cooperative information of DF relaying, and the uninformed 
encoder sends both the common and individual messages. By contrast, [13], [16] derive the 
capacity region for the reverse situation in which the informed encoder sends both the common 
and individual messages, and the uninformed encoder sends only the common message. This 
swapping of roles makes coding at the relay more involved than in [13], [16] for the state- 
dependent MAC and the [18], [19] for the related state-dependent RC with informed source. As 
we mentioned earlier, a MAC model that has closer connection with the model in this paper is 
investigated in [21]. This model is obtained by swapping the roles of the encoders in [13], [16]. 
Outline of Proof of Theorem 3} 

First we generate a random codebook that we use to obtain the lower bound in Theorem [T] 
Next, we outline the encoding and decoding procedures at the source and the relay. The coding 
scheme is based on a combination of codeword splitting, regular-encoding backward decoding 
for DF [48], and a variation of Gel'fand-Pinsker binning. A formal proof with complete error 
analysis is given in Appendix |A} In the formal proof we also show that the input constraints are 
satisfied. 
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Codebook generation: Fix a measure Ps,Ui,U2,Xi,X2,Y2,y 3 satisfying ([5]) and E[<^j(Xj)] < r i; 
i = 1,2. Fix e > and denote 



M = 2 n( - R ~ At) . (7b) 



1. We generate M independent and identically distributed (i.i.d.) codewords {u\(w')} indexed 
by w' = 1, . . . , M, each with i.i.d. components drawn according to Pjj 1 . 

2. For each codeword ui(w'), we generate M i.i.d. codewords {x. 1 (w',w)} at the source 
indexed by w = 1, . . . , M, and J auxiliary codewords {u 2 (u>', j)} at the relay indexed 
by j = 1, . . . , J. The codewords x 1 (w / , u>) and u 2 («/, j) are with i.i.d. components given 
Ui(w') drawn according to Pxi|J7i an d -P^ic/x, respectively. 

Outline of the coding scheme: We outline the coding scheme in the following. The message 
W to be sent from the source node is divided into B blocks wi, w^, ■ ■ ■ , wb of nR bits each. 
For convenience we let w B+1 = 1. The transmission is performed in B + 1 blocks. We denote 
by s[i] the channel state in block i, i = 1, . . . , B + 1. 

Continuing with the strategy, in the first block, the source transmits xi(l, Wi). The relay searches 
for the smallest j E {1, ••• , J} such that Ui(l), u 2 (l,j) and s[l] are jointly typical (the 
properties of strongly typical sequences guarantee that there exists one such j). Denote this 
j by j* = j(s[l],l). Then, the relay transmits a vector x 2 (l) with i.i.d. components given 
(ui(l),u 2 (l, j*), s[l]) drawn according to the marginal Px 2 \u 1 ,u 2 ,s induced by the distribution 
©• 

The decoder at the relay uses joint typicality. It declares that message w\ is sent if there is 
a unique w\ such that xi(l,wi) is jointly typical with (y 2 [l],s[l]) given ui(l), u 2 (l, j*) and 
x 2 (l), where y 2 [l] denotes the information received at the relay in block 1. One can show that 
the relay can decode reliably as long as n is large and 

R<I(X l ;Y 2 \S,U l ,X 2 ). (8) 

So, suppose the relay correctly obtains w 1 . In the second block, the source transmits x 1 (w 1 , w 2 ) 
and the relay transmits a vector x 2 (wi) with i.i.d. components given u 1 (u' 1 ), u 2 (wi, j(s[2],wi)), 
s[2] drawn according to the marginal Px 2 \Ui,u 2 ,s', tne sequence u 2 (^i, j(s[2], ^i)) is chosen 
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such that j(s[2],wi) is the smallest j e {1, • • • , J} satisfying Ui(wi), u 2 (w 1 ,j) and s[2] are 
jointly typical. Upon observation of y 2 [2], the decoder at the relay declares that w) 2 is sent 
if there is a unique w 2 such that xi(u>i, ty 2 ) is jointly typical with (y 2 [2], s[2]) given ui(iui), 
u 2 (wi, j(s[2], u>i)) and x 2 (wi). Again, it can decode reliably as long as n is large and ([8]) is 
true. At the relay, one continues in this way until block B + 1. 

Consider now the destination, and let y3[i] be the received information at the destination in 
block i. Suppose these information are collected until the last block of transmission is completed. 
The destination can then perform Willem's backward decoding [48], by first decoding w B from 
y 3 [B + 1]. Note that y 3 [B + 1] depends on xi(w B , 1), Ui(w B ) and u 2 (w B ,j(s[B + l],w B )), 
which in turn depends only onw B . The decoder at the destination uses joint typicality. It declares 
that w B is sent if there is a unique w B such that xi(ws, 1), ui(w B ), u 2 (w B ,j B ), y 3 [B + 1] are 
jointly typical, for some index j B G {1, . . . , J}. One can show that the destination can decode 
reliably as long as n is large and 

R < I(X h U h U 2 ; Y 3 ) - I(U 2 ; S\U X ). (9) 

So, suppose the destination correctly obtains w B . Next, the destination decodes w B -i from ys[fi], 
which depends on ■x.i(wb-i,Wb), Ui(wb-i) and u 2 (%_i, j(s[B), wb-x}). Since the destination 
knows Wb, it can again decode reliably as long as n is large and (|9]) is true. At the destination, 
one continues in this fashion until all message blocks have been decoded. The average rate over 
the B + 1 blocks is RB/(B + 1) bits per use, and by making B large one can get the rate as 
close to R as desired. 

Remark 3: In the case of classic RC without state, one can consider three different decode-and- 
forward strategies: irregular encoding successive decoding [47], regular encoding sliding-window 
decoding [49] and regular encoding backward decoding [48]. It is well known that these three 
strategies achieve the same rate in this case [50]. In the state-dependent case with informed 
relay, one can show that backward decoding achieves rates higher than those of sliding-window 
decoding. More precisely, sliding window decoding at the destination at the end of block i is as 
follows (we use the notation in the proof of Theorem [T]). The destination knows w^ 2 and also 
the correct index j(s[i — l],u>j_ 2 ), and decodes u>j_i based on the information received in the 
two adjacent blocks % — \ and i. It declares that the message is sent if there is a unique pair 
(wi-iji-i) such that the vectors u^w^), u 2 (wi- 2 , j(s[i - l],u>i_ 2 )), xi(iUi_ 2 , Wi-i), y 3 [i - 1] 
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are jointly typical, and the vectors u 1 (u) i _ 1 ), u 2 (wi-i, ji-i), y3^] are jointly typical. Thus, the 
destination obtains the message if 

R < I(X U U u U 2 ; Y 3 ) - I(U 2 ; S\U X ) (10) 
I(U 2 ;Y 3 \U 1 )-I(U 2 ;S\U 1 )>0. (11) 

Hence, with window decoding also, the achievable rate is obtained by maximizing the RHS of 
([4]). However, unlike the above backward decoding scheme, the maximization is over a set of 
distributions of the form ([5]) that satisfy the constraint ( [TTj ). Because of the additional constraint, 



this set is smaller than the one used in Theorem [TJ Informally speaking, the additional constraint 



( [TTj ) guarantees that, in the decoding of the vectors Ui and u 2 , the destination can actually 
decode the vector u 2 fully, i.e., it can determine not only the bin index (i.e., the message u>j_i) 
but also the correct sequence in the bin (i.e., the index j(s[z], Wi-\)). | 

The achievable rate in requires the relay to fully decode the message sent by the source, 
and this can be rather a severe constraint. We can generalize Theorem [TJ by allowing the relay to 
decode the source message only partially [51]. This can be done by implementing rate-splitting 
at the source [52] and introducing a new random variable U that represents the information 
decoded by the relay. The following corollary gives the resulting rate. 

Corollary 1: The capacity C(T) of the state-dependent DM RC with informed relay satisfies 
C(T) > R' ]0 (T), where 

R ,lo (T) = maxmin ^(U-^IS,^, X 2 ) + I(X 1 ;Y 3 \U,U h U 2 ) 

+ min{0, 1(U 2 ; Y 3 \U, U x ) - I(U 2 ; S\U{)}, I(X U U 1} U 2 ; Y 3 ) - I(U 2 ; S\U X )}, 

(12) 

with the maximization over all probability distributions of the form 

Ps,U 1 ,U 2 ,U,X 1 ,X 2 ,Y 2 ,Y 3 = 

QsPu! Pu\Ux Px 1 \Ui,uPu 2 \Ui ,sPx 2 \Ui ,U 2 ,sWy 2 ,y 3 |Xi ,x 2 ,s (13) 

and satisfying E^pQ)] < Tj, i = 1,2, and U\ € Ui , U 2 G U 2 , U E U are auxiliary random 
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variables with 

|Ui| < |S||Xi||X 2 | + 2 (14a) 
|U 2 | < (jS||Xi||X 2 | +2)|S||Xi||X 2 | +2 (14b) 
|U| < (jS||Xi||X 2 | + 2)|S||Xi||X 2 | +2, (14c) 

respectively. 

The proof of Corollary [T] is similar to that of Theorem [T] and, hence, only an outline of it is 
given in Appendix [Bj For instance, the particular choice U = Xi in Corollary [T] gives the lower 
bound in Theorem [T] 

An informal interpretation of the rate ( fT2| ) for the case in which [I(U 2 ; Y 3 \U, U\)—I{ U 2 ; S\Ui)] > 
is as follows. Since I(U ; Y 3 \Ux, U 2 , Xi) = for the distribution considered in ( [T3| ), the second 



term of the minimization in (12) can be written as 



I(U, U u U 2 ; Y 3 ) - I(U 2] S|fA) + I{X X - Y 3 \U, U x , U 2 ). 

The rate p2] ) can then be interpreted as the rate achievable if the message W transmitted by 
the source is split into two independent parts, one of which is transmitted through the relay, say 
at rate R r , and the other is transmitted directly to the destination without the help of the relay, 



say at rate Rd. The total rate is R = R r + Rd- In ( [12] ) the auxiliary variable U stands for the 
information decoded by the relay and plays the role of X% in Theorem [T] Thus, it follows from 
([4]) that the message transmitted through the relay can be decoded correctly at the destination if 
rate R r satisfies 

Rr < mm{l(U;Y 2 \S,U 1 ,X 2 )J(U,U 1 ,U 2 ;Y 3 ) - I^Sp!)}. (15) 

It can also be easily argued (see Appendix [B]) that the additional information, which is sent on 
top of the information transmitted through the relay, can be decoded correctly at the destination 
if rate R d satisfies 

R d < I(X 1 ;Y 3 \U,U 1} U 2 ). (16) 



This shows that message W can be sent at the rate ( |T2| ). 

Remark 4: The relay can employ other relaying schemes to assist the source, such as estimate- 
and-forward [47], amplify-and-forward [53]-[55] or combinations of these schemes. However, 
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none of these schemes achieves capacity even if the channel is state-independent. Hence, though 
some of these schemes may perform well in terms of achievable rates for some particular 
channels, we do not focus on these schemes in this paper. 

B. Upper Bound on Capacity 

The following theorem provides an upper bound on the capacity of the state-dependent DM 
RC with informed relay. 

Theorem 2: Let V = (ri,r 2 ) be given. The capacity C(T) of the state-dependent DM RC 
with informed relay satisfies C(T) < R up (T), where 

i? up (r) = maxmia^I{X 1 ;Y 2 ,Y 3 \S,X 2 ), 

I(X u X 2 ',Y 3 \S)-I(Xr,S\Y 3 )} (17) 

with the maximization over all probability distributions of the form 

Ps,Xi,x 2 ,y 2 ,y 3 = QsPx 1 Px 2 \x 1 ,sWy 2 ,y 3 \x x ,x 2 ,s (18) 

and satisfying E[<^(Xj)] < Tj, i = 1,2. 

The proof of Theorem [2] appears in Appendix [C] 

In the second term of the minimum in ( fTTj ), I{X 1 \ S\Y 3 ) can be interpreted as the rate penalty 
caused by the source's not knowing the channel state. This rate loss makes the above upper 
bound tighter than the trivial upper bound obtained by assuming that the channel state is also 
available at the source and the destination, i.e., the cut set upper bound 

RZ(T) = maxmm{l(X 1 ;Y 2 ,Y 3 \S,X 2 ),I(X 1 ,X 2 ;Y 3 \S)} (19) 

with the maximization over all distributions of the form 

PsxiX2,Y2,Yz — QsPx 1 \sPx 2 \x 1 ,sWy 2 ,y 3 \x 1 ,X2,s (20) 

and satisfying E[^(Xj)] < Tj, i = 1,2. 

If the channel is physically degraded, the upper bound in Theorem [2] reduces to the one in 
the following corollary. 
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Corollary 2: The capacity of the state-dependent physically degraded RC with informed relay 
satisfies C D (T) < R^{T), where 

R^(T) = maxmm|j(X i; y 2 |S,X 2 ), 

I(X l ,X 2 ;Y 3 \S) - I(X 1 ;S\Y 3 )^ (21) 

with the maximization over all probability distributions of the form 

Ps,x 1 ,x 2 ,y 2 ,y 3 = QsPx 1 Px 2 \x 1 ,sWy 2 \x 1 ,X2,sWy 3 \y 2 ,X2,s (22) 

and satisfying E[</?i(Xj)] < Tj, i = 1,2. 

Similar to the general case in Theorem [2| the upper bound in Corollary [2] is tighter than the 



trivial upper bound in (19) for the degraded case 



IV. The Gaussian RC with Informed Relay 

In this section, we consider a state-dependent Gaussian RC in which both the channel state 
and the noise are additive and Gaussian. We also assume that the additive channel state is 
noncausally known to only the relay. First, we consider full-duplex transmission at the relay, 
i.e., the relay transmits and receives at the same time, and we derive lower and upper bounds on 
channel capacity for this case. Then we extend these results to the half-duplex mode in which 
the relay is constrained to operate in a time-division (TD) manner. 



A. Full-Duplex Channel Model 

For the full-duplex state-dependent Gaussian RC, the channel outputs Y 2>i and Y 3ji at time 
instant i for the relay and the destination, respectively, are related to the channel input X\^ from 
the source and X 2ji from the relay, and the channel state Si by 

Y 2ii = X M + Si + Z 2;i (23a) 
Y 3ii = X hi + X 2ii + Si + Z 34 , (23b) 

where S% is a zero mean Gaussian random variable with variance Q, Z 2) i is zero mean Gaussian 
with variance N 2 , and Z 3 ti is zero mean Gaussian with variance N 3 . The random variables Si, 
Z 2ji and Z 3ii at time instant i E {1,2, ... ,n} are mutually independent, and are independent of 
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(Sj, Z 2 j, Z 3 j) for j 7^ i. The random variables Z 2)i and Z 3 j are also independent of the channel 
inputs (X?,X%). 

For the full-duplex degraded additive Gaussian RC, the channel outputs F 2 ,« and Y 3 yi for the 
relay and the destination, respectively, are related to the channel inputs Xl^ and X 2; i and the 
state Si by 

Y 2>i = X lti + Si + Z 2ji (24a) 

Y 3ti = X 2>i + Y 2ti + Z' 3ti , (24b) 

where (Z' 3 1; • • • , Z 3 n ) is a sequence of i.i.d. zero mean Gaussian random variables with variance 
N 3 = N 3 — N 2 which is independent of Z 2 . 

The channel inputs from the source and the relay should satisfy the following average power 
constraints, 

n n 
1=1 8=1 

As we indicated previously, we assume that the channel state S n is noncausally known at only 
the relay. The definition of a code for this channel is the same as that given in Section |TTJ. with 
the additional constraint that the channel inputs should satisfy the power constraints ([25]). 



B. Lower Bounds on Capacity 

In this section, we derive lower bounds on the capacity of the state-dependent full-duplex 
Gaussian RC with informed relay. The results obtained in Section [III] for the DM case can be 
extended to memoryless channels with discrete time and continuous alphabets using standard 
techniques [56, Chapter 7]. 

The following theorem provides a lower bound on the capacity of the state-dependent full- 
duplex Gaussian RC with informed relay. 

Theorem 3: The capacity C G of the state-dependent Gaussian RC with informed relay satisfies 
C G > where 

J$ = mpmin(jlog(l + ^^), 

Pl2 I Z iV 2 

P 1 + 0P 2 + 2p' 12V / 6P 1 P 2 \ 1 . , 0P 2 (l-(% 



max 



1, f, , Pi + eP 2 + 2p' 12 ^6P l P 2 x 1 

ix - loff 1 H ; H — log 1 , 

p r 2& 2 H eP 2 + Q + N 3 + 2p' 2s ^eP 2 QJ 2 61 iV 3 
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with the maximization over parameters p' l2 E [0,1], 6 E [0, 1], and p' 2s E [— 1, 0]. 
Proof: A formal proof of Theorem [3] is given in Appendix [Dj 
Outline of Proof: 

• We compute the lower bound @ for an appropriate choice of the input distribution that 
will be specified in the sequel. By extension, Remark [T] also applies for the Gaussian case. 
More specifically, we should consider two important features in the design of an efficient 
coding scheme at the relay: obtaining correlation or coherence between the channel inputs 
from the source and the relay, and exploiting the channel state to remove the effect of 
the state on the communication. As we already mentioned, it is not obvious to accomplish 
these features because the channel state is not available at the source. Proceeding like for 
the code construction in the DM case, we split the relay input X 2 into two parts, namely 
[/" and X 2 . Furthermore, here we set C/f and X 2 to be independent. The first part, C/f , is a 
function of only the cooperative information, and is generated using standard coding. Since 
the source knows the cooperative information at the relay, it can generate its codeword X™ 
in such a way that it is coherent with U™, by allowing correlation between X r { and [/". 
The second part, X 2 , which is independent of the source input X™, is a function of both 
the cooperative information and the channel state S n at the relay, and is generated using a 
GDPC similar to that in [14], [16], [18]. 

• More formally, we decompose the relay input random variable X 2 as 

X 2 = U 1 +X 2 , (27) 

where: U% is zero mean Gaussian with variance 9P 2 , is independent of both X 2 and S, and 
is correlated with X x with E[[/iXi] = p' 12 \/6P 1 P 2 , for some 6 E [0, 1], p' 12 E [-1, 1] ; and 
X 2 is zero mean Gaussian with variance 0P 2 , is independent of X\, and is correlated with 



the channel state S with E^S 1 ] = p' 2s y/6P 2 Q, for some p' 2s E [—1, 1]. Expressed in terms 
of the covariances o\ 2 = E[X X X 2 ] = K[XiUi] and a 2s = E[X 2 S] = E[X 2 S], the parameters 

Pi2> Pzs ^ g iven b Y 

Pl2 ~7mE' p2 '~vmQ- <28) 

For the GDPC, we choose the auxiliary random variable U 2 as 

U 2 = X 2 + a opt S (29) 
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with 



3 



a ° pt " 6P 2 (1- P g) + N 3 ■ (30) 
Similarly to in the DM case, we can generalize Theorem [3] by allowing the relay to decode 
the source message only partially, through rate-splitting at the source. The following corollary 
gives the resulting rate. 

Corollary 3: The capacity Cq of the state-dependent Gaussian RC with informed relay satis- 
fies C G > R'g, where 

R*° = maxmin{ri,T 2 ,T 3 } (31) 

with 

Tl = 2 bg l 1 + N 2 + lPl ) + 2 l0g I 1 + N 3 + Ha>A P 'J (32) 

T _ 1 10 , A | 7A(i - P'llh i log / g(g + Q' + iP + jVjO 

2 2 ^\ ^ N 2 + 1 P 1 )^ 2 *\P&(l-a>)z + N 3 (P> + a'2Q') 



1, A P 1 + 6P 2 + 2p' 12V /6^P 1 P 2 \ 1 / P^ + Q' + iV, 



»3J 



2 to V 0P 2 + Q + N 3 + 2p' 2s ^mQ) 2 ^\P^Q'(l-a') 2 + N 3 (P^ + a' 2 Q' 

(34) 

P' 2 := 9P 2 (l-pg), Q' := {^Q + p' 2s V6P~ 2 ) 2 , p' 2s ) := ^J^f - ; and the maximization 

is over parameters 7 G [0, 1], G [0, 1], p' 12 G [0, 1], p' 2s G [-1,0], and a' G R such that the 
second logarithm terms in T 2 and T 3 are defined. 



Outline of Proof: An informal proof of Corollary [3] is as follows. We decompose the message 
W to be sent from the source into two independent parts W r and Wd- The message W r will be 
sent through the relay, at rate R r ; and the message Wd will be sent directly to the destination, at 
rate Rd- The total rate is R' G = R T + R d . The input Xf from the source is divided accordingly 
into two independent parts, i.e., X™ = U n + Xf, where U n carries message W r and has power 
constraint njPi and X™ carries message Wd and has power constraint wyPi, for some 7 G [0, 1]. 
The relay decodes and forwards only the part U n , and its input sequence is obtained in a manner 
which is similar to that in the coding scheme for Theorem [3] (with U n playing the role of X™ 
therein). 
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The rest of the proof follows by computing the lower bound in Corollary [T] using an input 
distribution and techniques that are essentially similar to those in the proof of Theorem [3j An 
outline of the important steps is given in Appendix |E| 

C. Upper Bound on Capacity 

The following theorem provides an upper bound on the capacity of the state-dependent full- 
duplex general Gaussian RC with informed relay. 

Theorem 4: The capacity C G of the state-dependent general Gaussian RC with informed relay 
satisfies C G < R G P , where 



-axmm{ilo g (l + W - r ^ i: )(^ + ^)), 

1 io g d + _ , m+^m _ ) + 1 io g (i + ^ - ^ - ^ 



2 °V p 2 (i_ p 2 2 _ p 2 s) + ( ^g + p2s ^2 + iV3 ; 2 °^ N 3 

(35) 

with the maximization over parameters p 12 G [0, 1] and p 2s G [—1,0] such that 

Pi2 + PL<1- (36) 
Proof: The proof of Theorem H is given in Appendix |fJ In the proof, we evaluate^ the upper 
bound ( fTTj ) using an appropriate joint distribution of S, Xi, X 2 ,Y 2 ,Y 3 . 

Following straightforwardly the proof of Theorem [4] in Appendix |FJ it can be easily shown that 
the capacity of the state-dependent degraded Gaussian RC is upper-bounded as in the following 
corollary. 

Corollary 4: The capacity Cdg of the state-dependent degraded Gaussian RC with informed 
relay satisfies C DG < -R D P G , where 



P2s 

1 log f 1 + ^ 5 ( ^ + ^ )2 ^ ^ + 1 logfl + P2(1 - ^ - Pl 



2 b V p 2 (i_ p 2 2 _ p 2 s) + (v ^ + p2sV ^ )2 + iV3 ; 2 Na 

(37) 

'in Theorem [4] if the maximizing p2 S in ([35} has absolute value equal to unity then (36} implies that p\2 is zero. In this 
case, and also in the rest of this paper, we use the convention that § = 0. 
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with the maximization over parameters p 12 G [0, 1] and p 2s G [— 1,0] such that 

P? 2 + pL<1- (38) 

D. Analysis of some Special Cases 

We note that comparing the above lower and upper bounds analytically can be tedious in the 
general case. In what follows, we identify a few cases in which the lower bound in Theorem[3]and 
the upper bound in Corollary [4] meet for degraded Gaussian channels, and some extreme cases 
for which the lower bound in Corollary [3] and the upper bound in Theorem [4] meet for general, 
i.e., not necessarily degraded, Gaussian channels; and so we obtain the capacity expression for 
these cases. 



In the following corollary we recast the lower bound (26) into an equivalent form by substi 



tuting Q12 = p[ 2 Vd an d Q2s = P2 S V^- Also, we recast the upper bound given in Corollary [4] into 



an equivalent form by substituting k = p\ 2 / yl — p\ s and p = p 2s . 

Corollary 5: For the Gaussian RC, the lower bound ( [26] ) in Theorem [3] can be written as 

1 , l+ (v^ + iWK)' + (fl-&)fl s + 1 lo + p 2 (i - S - A.) 



2 a \ P2(l-»-&) + (s/Q + t>2,^/Pl) 2 + N 3 > 2 m JV 3 

(39) 

where the maximization is over parameters 9 6 [0, 1], Qi 2 E [0, 1], g 2s G [—1, 0] such that 

q\ 2 + g 2 2s < 1. (40) 
For the physically degraded case, the upper bound in Corollary [4] can be written as 



R^ G = max min < - log (l + 

K 2 \ 



2 



1, . P 2 (l- K 2 (l- p 2 ) - p 2 ) 

max - log 1+ iii> 



p 2 ov N-i 



+ 2 g l + p 2 (l- K 2(l-^)) + Q + 2pV7W + ]V3/ / 



where the maximization is over parameters k G [0, 1] and p G [—1,0]. 
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By investigating the bounds in Theorem [3] and Corollary [4] and the equivalent expressions of 
these bounds in Corollary [5} it can be shown that the lower bound for the degraded case is tight 
for certain values of P\, P 2 , Q, N 2 , N 3 . The following observation states some cases for which 
the lower bound is tight. 

Observation 1: For the physically degraded Gaussian RC, we have: 

1) If Pi, P 2 , Q, N 2 , N 3 satisfy 

AI . P 1 N 3 (P 2 + Q + N 3 + 2CVP 2 Q) (A0 . 
Jy 2 > max — , (42) 

" ce[-i,o] PiA^s + P 2 (l - C 2 )(Pi +P 2 + Q + N 3 + 2Cv/iW) 
then channel capacity is given by 

C DG = ^log(l + ^), (43) 

which is the same as the interference-free capacity, i.e., the capacity if the channel state were 
not present in the model, or were also known to the source. 

2) If the maximizing p 12 and p 2s in the upper bound in Corollary [4] are such that condition 
([36]) is met with equality, i.e., p\ 2 + p\ s = 1, then the lower bound ( f39| ) is tight and gives the 
capacity. 

Proof: The proof of observation [T] appears in Appendix |Gj 

Remark 5: The condition in (|42]) specifies a range of values (Pi, P 2 , Q, N 2 , N 3 ) for which the 
lower bound for the degraded Gaussian case is tight. In this case, the capacity is the same as 
that of the degraded Gaussian RC with informed relay and informed source or interference-free 
capacity. Thus, the first statement in Observation [T] also provides a sufficient condition for the 
rate loss incurred by not knowing the interference at the source as well to be zero. At a high 



level, condition (j42J) means that there is no rate loss due to the asymmetry when capacity is 
constrained by the broadcast part in the model, i.e, transmission from the source to the relay and 



the destination. By investigating the upper bound (|4Tj) and comparing it with the interference- 
free capacity, it can be shown that this condition is also necessary. That is, the interference-free 



capacity is attained only if ( [42] ) is fulfilled. If the capacity of our model is constrained by the 
sum rate of the cooperative MAC part, i.e., the cooperative transmission from the source and 
the relay to the destination, the asymmetry resulting from not knowing the interference at the 
source as well causes an inevitable rate loss, i.e., the term J(Xi; S\Y 3 ) in Corollary [2] 
Extreme Cases 

We now summarize the behavior of the above bounds in some extreme cases. 
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1) Arbitrarily strong channel state: In the asymptotic case Q — > oo, the lower bound in 
Theorem [3] and the upper bound in Corollary [5] meet, thus yielding the capacity for degraded 
Gaussian RC 

C DG (Q = oo) = min{il g(l + ^), il og (l + ||)}. (44) 



Equation (j44j) suggests that traditional multi-hop transmission achieves the capacity in 
this case. A two-hop scheme allows to completely cancel the effect of the channel state 
by subtracting it out upon reception at the relay, and by applying standard DPC for 
transmission from the relay to the destination. 

For arbitrarily strong channel state and general, i.e., not necessarily degraded, Gaussian RC, 
the lower bound in Corollary [3] and the upper bound in Theorem [4] meet if P 2 N 2 < P\N 3 
or P 2 + N 3 < Pi, and capacity in these cases is given by 

C G (Q = oo) = Ilog(l + ^). (45) 

L 1\ 3 

It is interesting to note that if P 2 + N 3 < Pi the lower bound in Corollary [3] is maximized 
for a' = P 2 /(P 2 + N 3 ) and 7 = 1, meaning that direct transmission from the source to the 
relay is, not only possible, but also optimal in this case. The relay transmits independent 
information and decoding this information and subtracting it out at the destination, in a 
sense, clears the channel for the direct transmission. 
2) Deaf helper problem: In the case in which the relay is unable to hear the source (e.g., 
due to a very noisy or broken link source-to-relay) and Q — > oo, the lower bound in 
Corollary [3] and the upper bound in Theorem [4] meet if N 3 < \P X — P 2 \, giving 

C G (Q = 00, N 2 = 00) = \ log(l + min{ ^ 1,P2> ). (46) 

z i\ 3 

If Q, N 2 — ► 00 and iV 3 > \P 1 — P 2 \ the bounds do not meet. However, the lower bound 
is "within one bit" from the upper bound if P x + N 3 > P 2 , and it reaches it asymptotically 
in the power at the relay if P 2 + N 3 > Pi and P 2 3> iV 3 , i.e., 

R'g(Q = oo,N 2 = oo) = R u *(Q = 00, N 2 = 00) - o(l) 

= ilog(l + ^)-o(l) (47) 

where o(l) — > as P 9 — > 00. 
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3) For Q = 0, the lower bound in Corollary [3] reduces to the rate achievable using a partial 
decode-and-forward scheme in an interference-free relay channel, i.e., 

R G (Q = 0) = max min{i log(l 1 ~ " 



o<p' 12 ,7<i 2 N 2 + 1 P 1 



2 bg(1 + N^P, )} + 2 l0g(1 + A^' 

(48) 

and the upper bound in Theorem |4] reduces to the cut-set upper bound. Furthermore, if 
the channel is degraded these bounds meet and give the capacity of standard degraded 
Gaussian RC [47, Theorem 5]. 
4) If P 2 = 0, capacity for general Gaussian RC is given by 

Co(P 2 = 0)= 1 108(1 + ^). (49) 

E. Numerical Examples and Discussion 

In this section we discuss some numerical examples, for both the degraded Gaussian case and 
the general Gaussian case. We consider two numerical examples, a) Pi = P 2 = Q = 10 dB, 
iV 3 = 20 dB; and b)Pi = P 2 = Q = N 3 = 10 dB. 

Figure [4] illustrates the lower bound ([39]) and the upper bound ( |4T| ) as functions of the signal- 



to-noise-ratio (SNR) at the relay, i.e., SNR = P\/N 2 (in decibels), for a degraded channeQ Also 
shown for comparison are the cut-set upper bound p9] ) computed for the degraded Gaussian 
case and the trivial lower bound obtained by considering the channel state as an unknown noise 
and implementing full-DF at the relay [47, Theorem 5]. 

The curves show that the lower bound and the upper bound do not meet for all SNR regimes. 
However, as it is visible from the depicted numerical examples, the gap between the two bounds 
is small for the degraded case. Furthermore, the curves in Figure [4] also illustrate the results in 
observation [TJ by showing that the lower bound and the upper bound meet for the cases identified 
in Observation [TJ We note that the pentagram marker visible in Figure [4] indicates capacity when 



the noise at the relay is equal to the RHS of ( [42] ); and this illustrates the first case for which the 



lower bound and the upper bound meet in Proposition [TJ Also, Figure [5] depicts the variation of 

2 Note that for the full-duplex degraded Gaussian RC, the rate in Corollary j^J reduces to that in Theorem j^J 
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(b) 

Fig. 4. Lower and upper bounds on the capacity of the state-dependent degraded Gaussian RC with informed relay versus 
the SNR in the link source-to-relay, for two examples of numerical values (a) Pi = Pi = Q — 10 dB, N3 = 20 dB, and (b) 

Pi = P2 = Q = N :i = 10 dB. 



P12 + pL> where p i2 and p 2s are the maximizing for the upper bound, as a function of the SNR 
for the two numerical examples considered in Figure [4| and this illustrates the second case for 
which the lower and upper bounds meet in Proposition [TJ 

Figure [6] shows similar curves for the general Gaussian channel. The curves show that the 
lower bound ( |3Tj i is close to the upper bound ( [35] ) at large SNR, i.e., when capacity of the channel 
is determined by the sum rate of the MAC formed by transmission from the uninformed source 
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Fig. 5. The sum Pi2 + pL m the constraint l |36| >. Optimal pi2 and p2 S are the maximizing for the upper bound for the numerical 
examples considered in Figure [4] The upper subfigure is for the upper bound curve in Figure |4(a)| and the lower subfigure is 
for the upper bound curve in Figure |4(b)| 
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Fig. 6. Lower and upper bounds on the capacity of the state-dependent general Gaussian RC with informed relay and the 
maximizing 9, p'i2,P2s m {261 as functions of the SNR at the relay. Numerical values are Pi = P2 = Q — N3 — 10 dB. 



and the informed relay to the destination. At small SNR, the lower bound given in Corollary [3] 
improves upon that in Theorem [3] due to rate- splitting. 



Furthermore, Figure [6] also shows the variation of the maximizing 6, p' 12 , p^s m p6) as function 
of the SNR at the relay. This shows how the informed relay allocates its power among combating 
the interference for the source (related to the value of p' 2s ) and sending signals that are coherent 
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with the transmission from the source (related to the values of 9 and p' 12 ). 

Remark 6: In standard, i.e., state-independent, Gaussian relay channels, partial DF simply 
reduces to direct transmission if the link source-to-relay is too noisy, i.e, at low SNR. For the 
studied model, however, it is insightful to observe that the relay can still help the source even 
at very small SNR. This can be seen by observing that the lower bound ( f3T| ) is better than the 
trivial lower bound even at this range of SNR (the trivial lower bound in Figure [6] is obtained by 
treating the channel state as additional noise and implementing partial DF). This observation has 
some connection with the aforementioned deaf helper problem (see Case 2, Section "Extreme 
Cases"), and it can be interpreted as follows. The relay does not hear the source and generates 
its input X 2j i using a dummy DPC as X 2 = U 2 — S, where X 2 ~ N(0, P 2 ) is independent of S 
and U 2 is Costa's auxiliary random variable. Upon reception of Y 3ti = X\^ + X 2 ,% + Si + Z 3ti at 
the destination, the decoder first decodes the codeword U 2> i fully, i.e., not only the bin index but 
also the correct sequence in the bin. This can be done reliably as long as I(U 2 ; Kj) — (U 2 ; S) > 0. 
Then, the decoder at the destination subtracts out U 2> i from Y 3ji to obtain Y 3ji = Xy + Z 3)i from 
which it decodes the source's message using standard decoding, at full rate 0.51og(l + Pi/N 3 ). 
A related scenario for a helper over a state-dependent Gaussian MAC is studied in [17]. 

Remark 7: The gap between the lower bound and the upper bound which is visible at low 
SNR is due to that DF relaying (even partial) is not effective at small SNR and also to that 
our upper bounding technique is efficient on the MAC side but not on the BC side of the relay 
channel. 

In Figure [7J the lower and upper bounds are plotted as function of the interference power 
Q, for fixed value of the power at the relay and several choices of the power at the source. 
The curves are depicted for two examples of noise configuration: N 2 < N 3 (N 2 = 10 dB and 
N 3 = 20 dB), and N 2 > N 3 (N 2 = 20 dB and N 3 = 10 dB). The curves illustrate the discussion 
in the above extreme cases analysis. For instance, for both noise configurations, that the rate 
achievable for very large values of Q is strictly positive illustrates that transmission from the 
uninformed source to the uninformed destination is possible even in presence of an infinitely 
strong interference. Furthermore, the lower and upper bounds meet for the cases identified in 
the "Extreme Cases" Section, for both degraded Gaussian and General Gaussian channels. 
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Lower bound (Theorem 3) 
Upper bound (Corollary 4) 
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(a) Degraded Gaussian RC. P 2 = N 2 = 10, iV 3 = 20 dB 



Lower bound (Theorem 3) 
Upper bound (Corollary 4) 
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(b) General Gaussian RC P 2 = N 3 = 10, N 2 = 20 dB 



Fig. 7. Bounds on channel capacity as function of the interference power Q. The curves correspond to different choices of 
power at the source: from bottom to top Pi = 5, 10, 15, 20, 25 dB. 
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F. Half-Duplex Channel Model 



In this section, we extend the results of Section IV-A to the case of half-duplex relaying, i.e., 
the relay can either transmit only or receive only. We consider a state-dependent Gaussian RC 
with informed relay, and we assume that the relay operates in a time-division (TD) relaying 
mode. In the TD mode, for a given time window, the relay is in the receive mode for a fraction 
of the given time and in the transmit mode for the remaining fraction of this time. Since the 
message from the source is transmitted to the destination in n channel uses, in the remaining of 
this section, we refer to the time indices from ^ 1 to \yn\ as the relay-receive period and the 
time indices from \yn\ + 1 to n as the relay -transmit period, for some v E [0,1]. Furthermore, 
to generalize the model, we assume that the channel state is zero mean Gaussian with 
variance Q^> during the relay-receive period, and the channel state is zero mean Gaussian 
with variance during the relay-transmit period. The channel output Y 2i at time instant i at 
the relay is given by 

Y2,i = X^ + + Z 2 ,i, 

during the relay-receive period, and is zero with probability one during the relay-transmit period. 
The channel output at time-instant i at the destination is given by 

= Xi] + + Z 3>i during the relay-receive period (50a) 
Yff = X{ 2 ) + X 2 ,i + Sf } + Z 3<1 during the relay-transmit period. (50b) 

Furthermore, the source has average power constraint P± during the relay-receive period and 
average power constraint during the relay-transmit period ; the relay has average power 
constraint P 2 . 

For fixed values of v, P^, P^ and P 2 , we have the following upper and lower bounds on 
the capacity of the state-dependent half-duplex Gaussian RC with informed relay. 

Proposition 1: The capacity of the state-dependent TD Gaussian RC with informed relay is 
upper-bounded by 

i^P(TD) = max min{i^ p , Rf} (51) 

3 For a scalar x, [x] stands for the largest integer small than or equal to x. 
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with 

>(2), 



p(l) 

i£ p = ^(p( 2 ),p 2) g(2) )Pl2;P2s ) + ^i og (i + ^ (52b) 



where $?(Pi, P 2 ,Q, P12, P2 S ) is defined as the second term of the minimization in (|33>, and the 
maximization is over parameters p 12 G [0, 1] and p 2s G [—1,0] such that p 2 2 + p 2s < 1. 

Proposition 2: The capacity of the state-dependent TD Gaussian RC with informed relay is 
lower-bounded by 

#g(TD) = max min{R l ° } P 2 °, R l °} (53) 



with 



P, / Pf 2) + 9P 2 + 2p' 12X /9Pi 2) P 2 

+ ^ log 1 



2 V ^p 2 + q(2) + 2p ' 2s y/0P 2 QW + N 3 

+ £ log ( ^2 + ^+^) \ (54c) 

where, maximization is over parameters 9 G [0, 1], p' 12 G [0, 1], p' 2s G [—1,0] and a' G R such 
that the last logarithm terms on the RHSs of ( |54b[ ) and ( |54c[ ) are defined; 



and Pi := M> 2 (1 - p?J, Q'< 2 > := (^/QW + p 2lV m) 



2 



The proofs of Proposition [TJ and Proposition [2] appear in Appendix |H| 

Remark 8: The coding scheme employed for the proof of Proposition [2] pre-assigns the time 
slots for the relay's receiving and transmitting modes. All the nodes then know ahead of time 
when the relay receives and when it transmits. This is relevant for nodes synchronization but 
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suboptimal in general for information rate. Instead, one can let the source and the relay choose the 
relay's mode and, so, in a sense, transmit additional information to the destination through that 
choice. This idea is introduced in [57] in the context of wireline and wireless networks without 
state and is called mode coding therein; see also [58, Section 4.3]. More specifically, let M denote 
a random variable that takes on values 1 ("receive") and 2 ("transmit") with probabilities v and 
v, respectively. Also, let us redefine the channel so as to include the relay's operating mode as 
Wy 2 ,y 3 \x u x 2 ,s,m; set X[ = (X u M), X' 2 = (X 2 , M), U> = (U, M), U[ = (U u M), U' 2 = (U 2 , M) 
and choose U = x[ 1} if M = 1 and U = if M = 2. Then, using (X[,X' 2 , U', U[, U' 2 ) in place 



of (Xl, X 2 , U, Ui, U 2 ) in ([12]), it can be shown that this yields a rate which is obtained by 
maximizing the minimum among R l 2 ° and R l g + I(M; F 3 ), i.e., larger than ( |53] ). However, 



as mentioned in [58, Section 4.3], the improvement is no larger than 1 bit per block and, also, 
harnessing it in practice requires some challenges in general. 

V. Conclusion 

In this paper, we consider a state-dependent relay channel with the channel state available 
noncausally at only the relay, i.e., neither at the source nor at the destination. We refer to this 
communication model as state-dependent RC with informed relay. This setup may model the 
basic building block for node cooperation over wireless networks in which some of the terminals 
may be equipped with cognition capabilities that enable estimating to high accuracy the states 
of the channel. 

We investigate this problem in the discrete memoryless (DM) case and in the Gaussian case, 
and we derive bounds on the channel capacity. For both cases, the upper bounds are tighter 
than those obtained by assuming that the channel state is also available at the source and the 
destination, and they help characterizing the rate loss due to the asymmetry, i.e., having the 
channel state available at the relay but not the source. Key to the development of the lower 
bounds is a coding scheme that splits the codeword at the informed relay into two parts: one 
part depends only on the cooperative information, not on the known channel state, and is used 
to enable coherent transmission from the source and the relay to the destination; another part 
is a function of both the cooperative information and the known channel state, and is used to 
combat the effects of the channel state on the communication through a generalized Gel'fand- 
Pinsker binning scheme. In the Gaussian case, we consider average power constraints at the 
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source and the relay and power allocation at the relay among the two parts of the code, allowing 
for a tradeoff between the coherence gain obtained through the coherent transmission and the 
mitigation of the channel state. 

Specializing the results to the case in which the channel is physically degraded, we show 
that the developed lower and upper bounds meet in some cases, thus characterizing the channel 
capacity. For the general Gaussian case, the bounds are in general close, but they meet only in 
some extreme cases. 

Appendix 

Throughout this section we denote the set of strongly jointly e-typical sequences [46, Chapter 
14.2] with respect to the distribution Px,y as T™(P X y)- 

A. Proof of Theorem [7] 

Consider the random coding scheme that we outlined in Section [inj We now give a formal 
description of the coding scheme and analyse the average probability of error. 
As we outlined after Theorem [T] we transmit in B + 1 blocks, each of length n. During each of 
the first B blocks, the source encodes a message Wi E [l,2 nR ] and sends it over the channel, 
where i = 1, . . . , B denotes the index of the block. For fixed n, the average rate R-jj^i over 
B + 1 blocks approaches R as B — ► +oo. 

Encoding: Let u>; be the new message to be sent from the source node at the beginning of 
block i, and w^i be the message sent in the previous block i — 1. At the beginning of block i, 
the relay has decoded the message u>j_i correctly and the source sends x.x(wi-i,Wi). The relay 
searches for the smallest j e {1, • • • , J} such that ui(u>j„i), u 2 (wj_i,j) and s[i] are jointly 
typical. Denote this j by j* = j(&[i],Wi-i). If such j* is not found, or if the observed state is 
not typical, an error is declared and j* is set to J. Then, the relay transmits a vector x 2 (wj_i) with 
i.i.d. components given (ui(u>j„i), u 2 (wi_i, j*), s[i]) drawn according to the marginal Px 3 \Ui,u 3 ,s 
induced by the distribution ([5]). 

The encoder at the source declares an error if the chosen codeword exceeds the power constraint, 
that is, <pi(xi(wi-.i,Wi)) > I\ + 7i(e) for some 71(e) > 0. Similarly, the encoder at the relay 
declares an error if </?g(x 2 (iUi-i)) > T 2 + 72(e), for some 72(e) > 0. 
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For convenience, we list the codewords at the source and the relay that are used for transmission 
in the first four blocks in Figure [8j 





block 1 


block 2 


block 3 


block 4 


Source codewords 


Xi(l,Uli) 


x 1 (w 1 ,w 2 ) 


*i(w 2 ,w 3 ) 


Xl(w 3 , 1) 


Relay codewords 


ur(l) 


Ui(«)i) 


Ui(«J 2 ) 


Ui(lU 3 ) 


u 2 (U(s[l],l)) 


u 2 (w 1 ,j(s[2],w 1 )) 


U2(t"2,j(s[3],tU 2 )) 




x 2 (l) 


x 2 (wi) 


X 2 (l« 2 ) 


x 2 (iu 3 ) 



Fig. 8. Regular encoding for DF for the state-dependent RC with informed relay. At the beginning of block 
i, the source transmits x\(wi^i,Wi) and the relay transmits a codeword X2(ui;_i) with i.i.d. components given 
(ui(wi_i),u 2 (wi_i,j(s[j],Wi_i)),s[j]) drawn according to the marginal Px 2 \u 1 ,u 2 ,s- 

Decoding: The decoding procedure at the relay is based on joint typicality. The decoding 
procedure at the destination is based on a combination of joint typicality and backward-decoding. 

1. At the end of block i, the relay knows w^i and declares that Wi is sent if there is a unique 
Wi such that xi(wj_i,Wj) and (y2[*],s[i]) are jointly typical given ui(wj_i), u 2 (u^_i, j*) 
and x 2 (u»i_i), where y 2 [i] denotes the output of the channel at the relay in block i and 
j* = j(s[i], u>j_i) as mentioned earlier. One can show that the decoding error in this step 
is small for sufficiently large n if 

R<I(X 1 ;Y 2 \S,U 1 ,X 2 ). (A-l) 

2. At the end of the transmission, the destination has collected all the blocks of channel 
outputs y 3 [l],y 3 [2], . . . ,y 3 [B + 1], and can then perform Willem's backward-decoding by 
first decoding wb from ys[B + 1]. 

First, the destination declares that wb is sent if there is a unique wb such that Ui(wb), 
u 2 (%, j B ), xi(w) B , 1), y 3 [i? + 1] are jointly typical, for some j B G {1, . . . , J}. One can 
show that the decoding error in this step is small for sufficiently large n if 

R < I(X 1 , U u U 2 ; Y 3 ) - I(U 2] Sp!). (A-2) 

Next, for b ranging from B to 2, the destination knows Wf, and decodes Wb-i based on 
the information received in block b. It declares that Wb-i is sent if there is a unique 
u>6_i such that Ui(-u) b _i), u 2 (wb_i, jb-i), ~x.i(wb-i,Wb), Y3[b] are jointly typical, for some 
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jb-i G {1, • • • , J}- One can show that the decoding error in this step is small for sufficiently 



large n if ( |A-2[ ) is true. 
Analysis of Probability of Error: 

Fix a probability distribution Ps^i^^x^x^^ satisfying (|5]) and and E[^(Xi)] < I\, i = 1, 2. 
Let s[i], Wi-x and Wi be the state sequence in block i, the message sent from the source node in 
block % — \ and the message sent in block i, respectively. As we already mentioned above, at the 
beginning of block i the source transmits xi(tUj_i, Wi) and the relay transmits a vector x 2 (u>j_i) 
with i.i.d. components conditionally given (ui(u>j_i), u 2 (u>i-i, j*), s[i\), with j* = j(s[i], i^-i), 
drawn according to the marginal Px 2 \Ui,U2,s- 

The average probability of error is such that 



Pr(Error) < ^ Pr(s)Pr(ui) 

(s,u 1 )^T|'(Q s F t/l ) 

+ Pr(s)Pr(ui)Pr(error|s,ui). (A-3) 

(s,ui) eif (QsPuJ 



The first term, Pr((s, Ui) ^ T^QsP^)), on the RHS of ( |A-3[ ) goes to zero as n — > oo, by the 
asymptotic equipartition property (AEP) [46, p. 384 ]. Thus, it is sufficient to upper bound the 
second term on the RHS of ( |A-3| ). 

We now examine the probabilities of the error events associated with the encoding and 
decoding procedures. The error event is contained in the union of the error events given below, 
where the events En, E 2 i and E 3i correspond to the encoding step at block i; the events 
and E 5i correspond to decoding at the relay at block i; the events Eqb and E 7 b correspond to 
decoding at the destination at block B + 1, and for b ranging from B to 2, the events £W&-i) 
and correspond to decoding at the destination at block b. 

• Let En be the event that there is no sequence u 2 (wi-i,j) jointly typical with s[i] given 
uiO;_i), i.e., 

E u = {$ j E {1,..., J} s.t. (u 1 (w^ 1 ),u 2 (w i - 1 ,j),s[i\^ G T?(P Uu u 2 , s )}. 

To bound the probability of the event Eu, we use a standard argument [2]. More specifically, 
for u 2 (u>i_i,j) and s[i] generated independently given ui(wj_i), with i.i.d. components 
drawn according to Pu 2 \u 1 an d Qs, respectively, the probability that u 2 (wj„!,j) is jointly 
typical with s[i] given Ui(iUi_i) is greater than (1 — ^2^ n( - I( - Ur ' s ^ UlS)+ ^ for sufficiently large n. 
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There is a total of J such u 2 's in each bin. The probability of the event E u , the probability 
that there is no such u 2 , is therefore bounded as 

Pr(£ii) < [1 - (1 - e )2-™( / (^;5|c/ 1 )+ e )]j_ (A . 4) 



Taking the logarithm on both sides of ( |A-4[ ) and substituting J using |7]) we obtain ln(Pr(£ , li )) < 



-(1 - e)2 ne . Thus, Pr(S M ) ^ as n -> oo. 

• Let P 2i be the event that the chosen codeword at the source, xi^j-i,^), exceeds the power 
constraint Y x by 71(e), 

E 2l = {tf( Xl (wi- ltV)i )) > Tt + 71(e)}. (A-5) 

By the weak law of large numbers, we have 

1 71 

Pr(E 2i ) =Pr(-^^>i(x M H) >r 1 + Tl (e)J 

i=i 

< e (A-6) 

for n large enough and E[</?i(Xl)] < IY 

• Let E 3i be the event that the chosen codeword at the relay, x 2 (wi_i), exceeds the power 
constraint T 2 by 72(e), 

E 3i = {^(x 2 (w < _ 1 )) > r 2 + 72 (e)}. (A-7) 

Using arguments similar to those for the event E 2i , we get Pr(E 3i \E^ i ) < e for n large 
enough and E[</? 2 (X 2 )] < T 2 , where E Xi denotes the event complement of E u . 

• Let En be the event that Xi(iOj_i, Wi), y2^], s[i] are not jointly typical given ui^-i), 
n 2 {w i ^ 1 ,f) and x 2 (wi_i), i.e., 

= {(ui(iB i _i),u 2 (tw i _i,j*),x 1 (iB i _i,iu i ),x 2 (iu i _i),y2[i],s[i]J £ 7r(P C / 1) [/ 2) x 1 ,x 2 ,y2 ) s)}- 

Conditioned on E 1 ^, P 2i , E 1 ^, we have that (s[i], ui(wj_i)) is jointly typical with u 2 (ii>j_i, j*) 
and with the source input xi(u>i_i, Wj) and the relay input x 2 (wj_i), i.e., 

^s[i],ui(«; i _i),U2(tt; i _i,j*),xi(iw i _i,tt; i ),X2(tt; i _i)J G (QsP^Px^P^^is^xJ. 

(A-8) 

For s[i], ui(wi_i), u 2 (wi_i, j*), xi(wj_i, u>j) and x 2 (u7;_i) jointly typical, we have Pr(P 4i |n| =1 
^w) — * as n — > 00, by the Markov Lemma [46, p. 436]. 
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Let E 5i be the event that x 1 (w i _ 1 , w-), y2[i], s[i] are jointly typical given u 1 (w i _ 1 ), u 2 (iUi_i, j*) 
x 2 (wj_i) for some w[ ^ Wi, i.e., 

E 5i = {3 ^ 6 {1, . . . , M} s.t. w\ ± Wi , 

(u 1 (w i ^ 1 ),u 2 (w i ^ 1 ,j*),x 1 (w i ^i, w' i ),x 2 (w i „ 1 ),y 2 (i),s) e T^{P UuU ^x x ,x 2 ,y 2 ,s)Y 

Using the union bound and standard arguments on strongly typical sequences, the probability 
of the event E 5i conditioned on E° if E 2i , E^, E^, can be easily bounded as 

Pr(E 5i |E 1 c i , E c 2il E^, E 2i) < M2- n(/(Xi;y2 ' 5|c/l ' l/2 ' X2) ~ e) (A-9a) 

_ 2-n(I(X 1 ;Y 2 \S,U 1 ,U 2 ,X 2 )-R+3e) (A-9b) 

where in ( |A-9b| ) we used the fact that I(X\\ S\U\, U 2 , X 2 ) = under the joint distribution 
©. Thus, Pr(E 5i \ n| =1 E c ki ) — > as n — > 00 if i? < 7(X i; Y 2 \S, U u U 2 , X 2 ). This 
condition, can be rewritten equivalently as 

R<I(X 1 ;Y 2 \S,U 1} U 2l X 2 ) 

= H(Y 2 \S,U h U 2 ,X 2 )- H(Y 2 \S,U X ,U 2 ,X U X % ) 

= H(Y 2 \S,U 1 ,X 2 )- H(Y 2 \S,X U X 2 ) 

= I(X 1 ;Y 2 \S,U U X 2 ) (A-10) 

where the second equality holds since the measure ([5]) satisfies Py 2 \s,u 1 ,u 2 ,x 2 — Py 2 \s,u 1 ,x 2 \ 
and Y 2 and (U\,U 2 ) are conditionally independent given (S, Xi,X 2 ). 
For the decoding of message at the destination, let E 6B be the event that ui(w B ), 
u 2 (w B ,j(s[B + l],w B )), Xi(iub, 1), Y3[B + 1] are not jointly typical, i.e., 

E eB = { (ui (w B ) , u 2 (w B , j (s[B + 1] , w B ) ) , xi (w B , 1) , y 3 [B + 1] ) t T e n (P Vl ,u 2 ,x u v 3 )}- 

For s[B + 1], ui(w B ), u 2 (w B ,j(s[B + l],w B )), Xx(w B ,l) and x 2 (w s ) jointly typical as 



shown by ( |A-8[ ), Pr(E 6B | njj =1 -E^) — ► as n — ► 00, by the Markov Lemma. 
For the decoding of message w B at the destination, let E 7B be the event that Ui(w' B ), 
u 2 (w' B ,j' B ), Xi(w' B , 1), ys[B + 1] are jointly typical for some w' B ^ w B and some j' B E 
{1,..., J}, i.e., 

E 7B = {3 w' B e {1, . . . ,M}Jb e {1, . . . , J} s.t. w' B j+ w B , 

u 1 (w' B ),u 2 (w' B ,j' B ),x 1 (w' B ,l),y 3 [B + 1]J G 2?0?fr 1 ,c&,Xi,y s )}- 
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Conditioned on the events E\ iy E^, E 3i , E% { , E^, E£ B , the probability of the event E 7B 
can be bounded using the union bound, as 

Vr{E 1B \ n| =1 E c ki , Eq B ) < MJ2- nWXl ' Ul ' U2 ' Y3) - e) (A- 11 a) 

= 2-n(I(X 1 ,U 1 ,U 2 ;Y 3 )-I(U 2 ;S\U 1 )-R+e)_ (A- lib) 

Thus Pr(E 7B \ n| =1 E c ki , E C &B ) — > as n — > oo if R < I(X U U x , U 2 ; Y 3 ) - I(U 2 ; S\U X ). 

• For the decoding of message w b _i at the destination, b = B, . . . , 2, let E 8 ( b -i) be the event 
that Ui(w 6 _i), u 2 (w 6 _i, j(s[6],ty 6 _i)), Xi(w 6 _i,w b ), y 3 [6] are not jointly typical, i.e., 

E 8 (b-i) = {(u 1 (w b _i),u 2 (w b _i,j(s[&],W6-i)),x 1 (w b „i,w b ),y 3 [6]j ^(P^^^.yg)}. 

For s[6], Ui(to 6 _i), u 2 (w6_i, j(s[6],ty 6 _i)), xi(io 6 _i,w 6 ) and x 2 (w b _i) jointly typical as 
shown by ( |A-8[ ), Pr(£ l 8 ( b _ 1 )|n| =1 £'^, -E 1 ^, i^ s ) — > as n — ► oo, by the Markov Lemma. 

• For the decoding of message w b _i at the destination, let ©9(&_i) be the event that u 1 («;[ ) _ 1 ), 
^-2(w' b _ 1 ,j' b _ 1 ), yLx{w' h _ l: Wb), y?,\b] are jointly typical for some vJ b _ x ^ w b -i and some 
j' b -i e {l,--,^}> i-e., 

■©9(6-1) = {3 w 6 _i G{1, . . . ,M}, E {1, . . . , J}, s.t. rt^-i 7^ 

(ui K_ ! ) , u 2 ( w' b _ ! , ^_ x ) , X! ( x , w 6 ) , y 3 [6] ) € T™ (P Vl , U2 jXl jY3 ) I . 

Proceeding like for the event E 7B , one can easily show that Pr(E 9 ( 6 _ 1 ) \(~)l =1 E ki , Eq B , E 7B , E^ b _^] 
can be bounded similarly to in ( |A-11| ), and this shows that Pr(£ , 9 (5_ 1 ) \nl =1 E ki , E£ B , E 7Bl E^ b _^) 
— > as n — > 00 if i? < I(X X , U x , U 2 ; Y 3 ) - I(U 2 ; Spt). 
It remains to show that the rate is not altered if one restricts the random variables JJ\ and U 2 
to have their alphabet sizes limited as indicated in (|6]). This is done by invoking the support lemma 
[59, p. 310]. Fix a distribution p of (S, U u U 2 , X u X 2 , Y 2 , Y 3 ) on T(SxlXixlX 2 xXixX 2 xy 2 xy 3 ) 
that has the form ([5]) and satisfies E[<^j(Xj)] < Tj, i = 1, 2. 



To prove the bound ( |6a| ) on |Ui|, note that we have 

I lt (X 1 ;Y 2 \3,U 1 ,X 2 ) = I^X 1 ;Y 2 ,S,X 2 \U 1 ) (A-12a) 
= H (t (X 1 \U 1 )+H ll (Y 2 ,S,X 2 \U 1 )-H tt (X 1 ,X 2 ,Y 2 ,3\U 1 ), (A-12b) 
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where ( |A-12a[ ) follows since (S : X 2 ) <-> U\ <-» X x under the distribution /i. Also, we have 

J M (Xi, C/i, C/ 2 ; Y 3 ) - I^U 2 - S\Ui) = I^Ui, Y 3 ) + I^Xt, U 2 ; Y 3 \U X ) - I„(U 2 ; S\U X ) (A- 13a) 

+ H^Xx, U 2 \Ux) - H^Xx, U 2 , Y 3 \Ux). (A-13b) 
Hence, it suffices to show that the following functionals of fi(S, Ux, U 2 , Xx, X 2 , Y 2 , Y 3 ) 

T' s ,x,x'{l J ') — A*( s ) x , x ') V (s, x, x') G S x Xi x X 2 (A- 14a) 

r x {p)= [ d»{u)[H^Xx\u) + H^Y 2 ,S,X 2 \u) - H^Xx,X 2 ,Y 2 ,S\u)\ (A- 14b) 

r 2 (//) = / d^iH^^l^-H^U^Ysl^-H^U^+H^Slu)], (A- 14c) 
can be preserved with another measure fi' that has the form (|5]). Observing that there is a total 



of |S| |Xi| IX2 1 + 1 functionals in ( |A-14[ ), this is ensured by a standard application of the support 
lemma; and this shows that the cardinality of the alphabet of the auxiliary random variable 
U\ can be limited as indicated in ( |6a| ) without altering the rate ([4]). We note that the inputs 
constraints for the source and the relay, which involve fx(S,Ux,U 2 ,Xx,X 2 ,Y 2 ,Y 3 ) only through 
its marginals over (S, Ui, U 2 , X 2 , Y 2 , Y 3 ) and (S, Ux, U 2 , Xx, Y 2 , Y 3 ) respectively, are satisfied. 

Once the alphabet of U\ is fixed, we apply similar arguments to bound the alphabet of U 2 , 
where this time |S| |Xi| |X2|(|S| |Xi| |X 2 | + 1) — 1 functionals must be satisfied in order to preserve 
the joint distribution of S, Ux, Xx, X 2 , and one more functional to preserve 

I^Xx, Ux, U 2 ; Y 3 ) - I^(U 2 ; S\Ux) = H,{Y 3 ) - H^S) - H^U^) + H^Ux, S\U 2 ) 

+ H^Xx, Ux\U 2 ) - H^Xx, U u Y 3 \U 2 ), (A-15) 



yielding the bound indicated in (6b). 



B. Proof of Corollary [7] 

The proof combines rate-splitting [52] and the techniques used in the proof of Theorem [T] 
As we already mentioned in the discussion following Corollary [T] we split the message W to 
be transmitted from the source node into two independent parts W r and Wd', the relay forwards 
only the part W r , at rate R r , and the part Wd is sent directly to the destination, at rate R d . The 
total rate is R = R r + R d . We transmit in B + 1 blocks, each of length n. During each of the first 
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B blocks, the source sends a message u>j = (w r i, w^i), with w rji G [1, 2 ni?r ] and G [1, 2 nRd ] 
and z = 1, . . . , B denotes the index of the block. For convenience, we let w r ,s+i = Wd,i = 1 • 
For fixed n, the average rate -R^xj over B + 1 blocks approaches R as B — ► +oo. 



Codebook generation: Fix a measure P^t/j^^Xi,^, 12,^3 satisfying ( fT3| ) and E[<^(JQ)] < r i5 
i = 1, 2. Fix e > and let 



J = 2 n(I(Ur,S\U 1 )+2e) (B . 16a) 

M r = 2 n( - Rr ~ 2e) (B-16b) 
M d = 2 n( ^~ 4e) . (B-16c) 



1. We generate M r i.i.d. codewords {111(10^.)} indexed by w' r = 1, .. . ,M r , each with i.i.d. 
components drawn according to Pu x - For each Ui(w' r ), we generate M r i.i.d. codewords 
{u(w' r , w r )} at the source indexed by w r — 1, . . . , M r , and J auxiliary codewords {u 2 (w' r , j)} 
at the relay indexed by j = 1, . . . , J. The codewords u(w' r ,w r ) and u 2 (w' r , j) are with 
i.i.d. components given Ui(w' r ) drawn according to Pup! and Pu 2 \Ui> respectively. 

2. For each Ui(w' r ), for each u(w' r ,w r ), we generate M d i.i.d. codewords {^i(w' rl w r ,Wd)} 
indexed by w d = 1, . . . , M d , each with i.i.d. components given (ui(w' r ), u(w' r , w r )) drawn 
according to P Xl \u u u- 

Encoding: At the beginning of block i, let Wi = (w r> i,Wd,i) be the new message to be sent 
from the source and = (w r! i-i, Wd,i-i) be the message sent in the previous block i — 1. 

At the beginning of block i, the relay has decoded io r ,i-i correctly, and the source transmits 
x.x(w rt i-i,w rt i,Wd > i). The relay searches for the smallest j G {1, • • • , J} such that u 2 (uv,i-i, j) 
and s[i] are jointly typical given ui(iy r) i_i). Since the vectors u 2 (if r i _i, j) and s[i] are generated 
independently given ui(w r i_i) according to the memory less distributions defined by the n- 
product of Pu 2 \Ui an d the n-product of Qs, respectively; and there are J sequences in the bin 
indexed by i« r ,i-i 5 the probability that there is no such sequence u 2 goes to zero as n — ► +00. 
Denote the found j by j* = j(s[i], w r i-i). The relay then transmits a vector x 2 (uv,i-i) with i-i-d. 
components conditionally given (ui(w r i _i), u 2 (w r j_i, j*), s[z]) drawn according to the marginal 
Px 2 \u 1 ,u 2 ,s induced by ( [T3] ). Using arguments similar to those in the proof of Theorem [Tj it can 
be shown that the inputs xi(iy r i_i, w r> i, Wd,i) and x 2 (w r ,i-i) satisfy the input constraints. 
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Decoding: The decoding procedures at the source and the relay are as follows. 

1. At the end of block i, the relay knows uv,«-i and declares that w rji is sent if there is 
a unique w Tji such that u(w r ,i-i,w rt i), y2[i] and s[i] are jointly typical given ui(iu r) i_i), 
u 2 (w r ,i-i, j*) and x 2 (uv,i_i). One can show that the decoding error in this step is small 
for sufficiently large n if 

R r <I{U;Y 2 \S,U 1 ,X 2 ). (B-17) 

2. At the end of the transmission, the destination has collected all the blocks of channel out- 
puts y 3 [l], y 3 [2], . . . , y 3 [£? + 1], and can then perform backward-decoding by first decoding 

(w r ,B,w diB +i) from y 3 [B + 1]. 

First, it declares that the pair (w ri B, w^b+i) is sent if there is a unique pair (w r> s, w d> B+i), 
with w r! B G {1, • • • , M r } and w d ,B+i G {1, . . . , M d }, there is j B G {1, . . . , J}, such that 
Ui(i£v,b)> u 2(4,b, js), u(ix) r ,B, 1), xi(w rijB , y 3 [B + 1] are jointly typical. One 

can show that the decoding error in this step is small for sufficiently large n if 

IU<I(.X 1 ;Yz\U,U x ,U 2 ) 
R d < Ipfx, U 2 ; Y 3 \U, U x ) - I(U 2 ; Spi) 
R r + R d < I(X X , U, U u U 2 ; Y 3 ) - I(U 2 ; Sp 1 ) 

( = } I(X h U h U 2 ; Y 3 ) - I(U 2 ; SpJ, (B-18) 



where in (a) we used the fact that I(U; Y 3 pi, U 2 , X\) = under the distribution ( [T3] ). 
Next, for b ranging from B to 2, the destination knows w r> b and decodes (w r> t,-i,w dt b) based 
on the information received in block b. It declares that the pair (w ri i,-i, w d ^) is sent if there 
is a unique pair (w rt b-i,Wd,b)> with w r ^-i G {1, . . . , M r } and w dy b G {1, . . . , M d }, there is 
jb-i G {1, . . . , J}, such that ui(w r ,b-i), u 2 (w r ,b-hjb-i), u(i£v,6-i, xi(w r)6 -i, w r ,6, w dj6 ) 
y 3 [b] are jointly typical. One can show that the decoding error in this step is small for 
sufficiently large n if ( |B-18| ) is true. 



It remains to show that the rate ( [12] ) is not altered if the sizes of the alphabets of the auxiliary 



random variables U, V\ and U 2 are restricted as in ( jT4[ ) . This can be easily done by following 
the steps in the proof of Theorem [T] 
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C. Proof of Theorem [2] 

Consider a sequence of (e n , n, R, T)— codes with e n — >■ as n 
be less than or equal R np (T). By Fano's inequality, we have 

H{W\Y 3 n ) < nRe n + 1 = n5 n . 

Thus, 

nR = H(W) < I(W; Y 3 n ) + nd n 

(C-20) 

We upper bound I(W;Y 3 ) as in the following lemma, the proof of which follows. 
Lemma 1: 

n 

i) I(W;Y 3 n ) < 53/(X 1 , i> X 2 , i ;y 3 ,i|5 i ) -I{S i ;X 1 , i \Y 3>i ) (C-21a) 

i=i 

n 

ii) I(W-Y^) <J2 I (^ Y 2,i,ys,i\Si,X 2 , i ). (C-21b) 

i=l 

Proo/- To simplify the notation, we use S l = (Si, S 2 , ■ • ■ , Si), Y k l = (Y k)1 , Y k>2 , ■ ■ ■ , Y k)i ), 
k = 2, 3, and X) = (X jtl ,X jt2 , ■■■ , X jti ), j = 1, 2. 
We obtain the bound on I(W;Y 3 ) given in (i) as follows. 

I(W; Y 3 n ) = I(W, S n ; Y 3 n ) - I{S n ; Y 3 n \W) 

n 

= ^I{W,S n ;Y^ j \Y 3 i - 1 ) -H{S n \W) + H{S n \W,Y 3 n ) 
i=i 

n 

= Y^W^M' 1 ) - H(Y z>i \W, S n , Yt 1 ) 
1=1 

-HiSJ+HiSilW^S 1 - 1 )] 

(a) n 

< Y^[H(Y 3 ^-H(Y 3ti \X lii ,X 2ti ,S i ) 
i=i 

-H{S i )+H(S i \W,Y»,S i - 1 ,X 1 J] 

< ^[/(x 1 , i> x 2 ,i,si;y 3 ,i) -/r(5 i ) + ff(s , i |x li i J y 3) i)] 
i=i 



+oo. We show that i? must 



(C-19) 
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-^2,i, S'i! *3,i) — ^3,i)] 

i=l 
n 

y^[-fpfl,z, ^2,i! 5 / 3,i|'S'i) — ^(^i! -X"l,i|^3,i)], 



where 

(a) follows from (W, S 71 , l^ -1 ) <-> (X lii: X 2ii , Si) <-> F 3ji (a Markov chain); and the fact that 
Xi 5 j is a deterministic function of W; and 

(b) follows from the fact that conditioning reduces entropy. 

We obtain the bound on I(W; Y 3 ) given in (ii) as follows. 

I(W;Y 3 n )<I(W;Y 2 n ,Y 3 n ) 
= H(W)-H(W\Y 2 n ,Y 3 n ) 

< H(W\S n ) - H(W\Y 2 n , Y 3 n , S n ) 



7 (^; y 2,i, Y^Yj- 1 , y*- 1 , S n ) 

i=l 
n 

I(W; Y 2>i , Y 3ti \Yt\Yt\ S n , X 2>i ) 



n 

(£) 
i=i 



— ^ ^(Y 2> i ,Y 3ti \ Y 2 l 1 , Y 3 l 1 , S n , X 2>i ) 
i=i 

- H(Y 2 „ Y 3ii \Yt\ Yt\ S n , X 2 „ W)] 

n 

— *y^\H(Y2^Y 3 ^\Y 2 l 1 ,Y 3 l 1 ,S n ,X 2ti ) 
i=i 

- H(Y 2 „ Y 3ii \Yt\ Yt\ S\ X 2 „ W, X lti )] 

n 

<Y}H{Y 2 ^Y^S h X 2 ^ 
i=i 

- H(Y 2>i , Y 3>i \Yt\ Yt\ S\ X 2 „ W, X lti )] 

n 

= y^ffiQ^.i) Y 3ti \Sj, X 2ti ) — H(Y 2t i,Y 3t i\Si,X 2t i,X lti )] 
i=i 

n 

— -^(^i,jj Y 2> i, Y 3;i \Si, X 2;i ), 



i=l 
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where 



(c) follows from the fact that W and S n are independent; and H(W\Y™, Y 3 n ) > H(W\Y 2 n , Y 3 n , S n ); 

(d) follows from the fact that X 2ji is a deterministic function of (S'™,!^ 1 ); 

(e) follows from the fact that X lti is a deterministic function of W; and 
(/) follows from the fact that the channel is discrete memoryless. 

■ 

Consider now the input constraints. By definition the code satisfies 



for w £ {1, • • • ,M}. 

We start with the input constraint of the source. Since each codeword satisfies the input constraint, 
their average over w\ also satisfies the input constraint. Thus, we have 





(C-22) 



i=i 



M 



w=l 
M , n 



W = l 1 = 1 




x™ w=l i=l 




n 



i=l w=l x 




(C-23) 



i=i 
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Similarly, for the input constraint of the relay, we have 

M n 



n 

W=l S n 1=1 



M 

E P(w)P(tr)PW\w)P(x2\8 n , xi)P{y n 2\s n i a 2 

ui=l s n ,x™,x2,y% 



,L ■-) 



n / — ' 

i=l 

^E E ^[^(^2,i)]. (C-24) 



n 
i=i 



We introduce a random variable T which is uniformly distributed over {1, • • • , n}. Set S* = St, 
X\ = Xi t T, X 2 = X 2 ,t, Y 2 = Y%t, and Y 3 = Y 3t T- We substitute T into the above bounds on the 
message rate and the input constraints. Considering the bounds given in Lemma [T| we obtain 

1 - 

— / I{Xii,X2i,Yzi\Si) — I (Si, X 1 i\Y 3 j) 



n 

8=1 



I(X X ,X 2] Y 3 \S,T) - I(S;X x \Yz,T) 

I(X 1 ,X 2 ,S;Y 3 \T) - I(S;X 1 ,Y 3 \T), (C-25) 



and 



1 n 

- V J(X M ; F 2 „ y 3)i |5 f , X 2)< ) = J(X i; y 2s r 3 |^, * 2 , T), (C-26) 
n ' 
1=1 

where the distribution on (T, 5, Xi, X 2 , Y" 2 , F 3 ) from a given code is of the form 

Pt ! s,x 1 ,x 2 X2,Y3 = PsPtPx 1 \tPx 2 \x 1 ,s,tWy 2 ,y 3 \s > Xi,x 2 - (C-27) 

Similarly, substituting T into the input constraints, we obtain 

1 " 

r fc > - ^2^2Px k (xk,i)vk(xk,i) 

n 1 
i=l 

= E[^(X fc )], fc = l,2. (C-28) 



November 24, 2009 



DRAFT 



46 



We now eliminate the variable T from ( |C-25| ) and ( |C-26[ ) as follows. The RHS of ( |C-25| ) can 
be bounded as 

IiX^X^S^T) - I(S;X U Y 3 \T) 

< H(Y 3 ) - H(Y 3 \X 1 ,X 2 , S) - H(S\T) + H(S\X U Y 3 , T) 
= I(X h X 2 , S; Y 3 ) - H(S\T) + H{S\X 1: Y 3 , T) 

< I(X h X 2 , S; Y 3 ) - H(S) + H(S\X h Y 3 ) 
= I(X 1 ,X 2 ,S;Y 3 )-I(S;X 1 ,Y 3 ) 

= I(X U X 2 - Y 3 \S)- I(S; X 1 \Y 3 ), (C-29) 

where 

(g) holds since H(Y 3 \T) < H(Y 3 ) and H(Y 3 \X 1 , X 2 , S, T) = H(Y 3 \X U X 2 , S) (by the Marko- 
vian relation T <->• (Xi, X 2 , S) <->• Y 3 ); and 

(h) holds since S is independent of T and Y 3 , T) < iJ(S|Xi,F 3 ). 
Similarly, the RHS of ( |C-26[ ) can be bounded as 



I(X i; Y 2 , Y 3 \S, X 2 , T) < I(X i; Y 2 , Y 3 \S, X 2 ). 



(C-30) 



Finally, combining flC^ , ( |C-21aD , ( jC^ , ( jC^ at one hand, and ( jC^ , ( |C-21b[ ), ( [C^ , 
( |C-30[ ) at the other hand, we get 



R < I{Xr,X 2 ;Yz\S)-I{S;Xi\Y 3 ) 
R < I{X 1 ;Y 2 ,Y 3 \S,X 2 ), 



(C-31a) 
(C-31b) 



where the distribution on (S, X 1 ,X 2 , Y 2 , Y 3 ), obtained by marginalizing ( |C-27[ ) over the variable 
T, has the form given in < [T8] ) and satisfies E[<y?j(Xj)] < Tj for i — 1, 2. 

We conclude that, for a given sequence of (e n , n, R, T)— codes with e„ going to zero as n goes 



to infinity, there exists a probability distribution of the form ( [18] ) such that the rate R satisfies 
( |C-31[ ) and the input constraints E[^?j(Xj)] < Tj, i — 1,2, are satisfied. This completes the proof 
of Theorem [2l 
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D. Proof of Theorem [3] 

In this proof, we compute the lower bound in Theorem [TJ using an appropriate jointly Gaussian 
distribution on S, X\, U\, U 2 , X 2 . The techniques used in this section rely strongly on those 
used in the proof of Theorem 6 in [16]. 

We first evaluate the second term of the minimization in (|4]) because this gives insights about the 
distribution that we should use to compute the lower bound. The second term of the minimization 
in can be written as 

I{X 1: U U U 2 -Y Z ) - I{U 2 -S\U 1 ) ^ 

I(X X , Ui, Y 3 ) + I(U 2 ; Y 3 \X h U x ) - I(U 2 ; S\X h U x ), (D-32) 

which follows from the fact that I(U 2 ; S\Ui) = I(U 2 ; S\Ui,Xi) for the considered distribution. 

We first focus on the evaluation of the term [I(U 2 ; Y 3 \Xi, U\) — I(U 2 ; S\X%, Ui)}. To evaluate 
it, we assume that X\ is zero mean Gaussian with variance P ls U\ is zero mean Gaussian 



with variance 8P 2 , and X\ and U\ are jointly Gaussian with E[f/iXi] = p' l2 for some 

9 E [0, 1], p' 12 E [—1, 1]. The random variables X\ and U\ are independent of S as shown by 
the distribution given in Theorem [T] We also consider 

X 2 = U x + X 2 (D-33) 

where, X 2 is zero mean Gaussian with variance 9P 2 , is independent of both X\ and U\, and 



is jointly Gaussian with S with E[X 2 S'] = p' 2s y/8P 2 Q, for some p' 2s E [—1, 1]. Then, from ( |23] ) 
and ( |D-33| ), we can write Y 3 as 



Y 3 = X 1 + U 1 +X 2 + S + Z 3 . (D-34) 

Let X 2 = E,[X 2 \S] be the optimal linear estimator of X 2 given S under minimum mean square 
error criterion, and X' 2 be the resulting estimation error. The estimator X 2 and the estimation 
error X' 2 are given by 



** = pLJ^S (D-35) 
X' 2 = X 2 -X 2 . (D-36) 
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We can alternatively write Y 3 in ( |D-34[ ) as 

y 3 = (x 2 -x 2 )+x 2 + x 1 + u 1 + s + z 3 
= x' 2 + x 1 + u 1 + s' + z 3 , 

where 



(D-37) 



We now consider the following new channel output Y 3 given by 

Yi := Y 3 - E[Y 3 \X U U 1 ] = X' 2 + S' + Z 3 . (D-38) 

This new channel output Y 3 ' is similar to the channel output considered in [3] because X' 2 is 
independent of the state S'. Hence, the capacity of this new channel is achieved if we use an 
auxiliary random variable 

U 2 = X' 2 + aS', (D-39) 



where a is Costa's parameter given by 



a = 



nx' 2 2 }+nzi] ep 2 {i-pg)+N 3 



(D-40) 



Then we can easily show that 

[I(U 2 ; Y 3 \X U U 1 ) - I(U 2 ; S\X 1} Ut)] = [I(U 2 ; Y£) - I(U 2 ; S% 



The term [I(U 2 ; Y 3 ) — I(U 2 ; S')] is maximized if U 2 is chosen as in ( |D-39[ ). Thus, we obtain 

I(U 2 ; Y 3 \X U U x ) - I(U 2 ; S\X U U x ) = - log ( 1 + J 



(D-41) 



By substituting X' 2 and 5" in ( |D-39[ ), we get 



U 2 = X 2 - p' 2sX l —S + a[l + p 2 



I9P 2 

~Q 



s 



X 2 + a opt S, 



(D-42) 
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where 



a, 



opt 




0P,(1 



N 3 



(D-43) 



The term I(Xi, U\\ Y 3 ) on the RHS of ( |D-32[ ) can be computed as 

I(Xx, U i; Y 3 ) = h(Y 3 ) - h(Y 3 \X u Ux) 

= h(Y 3 )-h(X 2 + S + Z 3 \X 1 ,U 1 ) 



(6) 



h{Y 3 )-h{X 2 + S + Z 3 ) 

1 / E[(X 1 + X 2 + S) 2 ]+E[Zf] 

2 H E[(X 2 + S) 2 ]+E[Zf] 

1, / n P l + 6P 2 + 2p' l2 ^6P 1 P 2 

- log 1 H — ; 

2 6 V 6Po + O + N-* + 2o'^/9K 



(D-44) 



2 "° V" ' 6P 2 + Q + N 3 + 2p' 2s ^6P 2 ~Q, 
where (b) follows from the fact that X 2 and S are independent of (Xi, U\). Then, by adding 
( |D-41[ ) and ( |D-44| ) we get the second term of the minimization in ([26]). 
The first term of the minimization in Q can be written as 

/(Xi ; Y 2 \S, U h X 2 ) = h(Y 2 \S, U h X 2 ) - h(Y 2 \S,U 1 ,X 1 ,X 2 ) 
= h(X l + Z 2 \S,U h X 2 ) - h(Z 2 ) 



(a) 



h{X x + Z 2 \U X ) - h{Z 2 ) 



(D-45) 



where (a) follows from the fact that X\ and (S,X 2 ) are independent conditionally on U\. 

Finally, we obtain the rate on the RHS of ( f26| ) by maximization over all possible values of 
9 E [0, 1], p' l2 E [—1, 1] and p' 2s E [—1, 1]. Investigating the two terms of the minimization, we 
can easily see that it suffices to consider p' 12 E [0, 1] and p' 2s E [—1,0]. 
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E. Proof of Corollary [5] 

Recall the outline after Corollary |3j We decompose the source input X x and the relay input 

X 2 as 



x x = u + x x 

X 2 = U X + X 2 , 



(E-l) 
(E-2) 



where U and X x are independent zero mean Gaussian random variables with variances 7 Pi and 
7 Pi, respectively, for some 7 E [0,1]; and U x and X 2 are independent zero mean Gaussian 
random variables with variances 9P 2 and 9P 2 , respectively, for some 9 E [0, 1]. Furthermore, X x 
is independent of all other variables; U and U x are correlated, with E[[/[/i] = p' X2 \f9^P x P 2 for 
some p' l2 E [0, 1], and are both independent of S; X 2 is independent of U, is correlated with S 



with E[X 2 S'] = p' 2s \/9P 2 Q for some p' 2s E [—1,0], and is obtained using a GDPC the auxiliary 
random variable of which is given by 



U 2 = X 2 + 



\QPt, 

Q 



P2s< 



l 9t\ 
Q \ 



S 



(E-3) 



for some a' E 
Let 



T :=I{U;Y 2 \S,U X ,X 2 ) 

T x :=T Q + I(X X ;Y 3 \U,U X ,U 2 ) 

T 2 := Tx + I(U 2 ; Y 3 \U, U x ) - I(U 2 ; S\U X ) 

T 3 := I(X X , U x , U 2 ; Y 3 ) - I(U 2 ; S\U X ). 



(E-4) 
(E-5) 
(E-6) 
(E-7) 



Also, define the following function and substitutions which we will use throughout the proof. 

P' 2 :=9P 2 {l-p'l) (E-8) 

Q' ■= (VQ + p'2sVm) 2 (E-9) 
P' 2 Q'{\ - a'f 



P' 2 + a' 2 Q' 



(E-10) 



and let Y 3 := X 2 + S + Z 3 
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i) The computation of the quantities T and T 3 can be done along the lines of those for the 
corresponding quantities in the proof of Theorem [3] We obtain 



T - l \oz(l I ^-P'& \ 
T,--\og[l+ — - ) 



(E-ll) 



and T 3 as given by (|34>. 
ii) Now, we compute T x . 



I(X i; Y Z \U, U lt U 2 ) = I(X i; X 1 + Y 3 \U, U u U 2 ) 

(a) 



I(X 1 ;X 1 + Y 3 \U 2 ) 



(b) 



h^ + Y^-h^U,) 



(c) 1 



log (l + 



E[Xf] 



2 V E[Yi]-E[Y 3 E[Y 3 \U 2 }] 



(J 1 
" 2 



log (l + 



7P1 



(E-12) 



iV3 + $( a '^,p' 2s 

where (a) holds since U and Z/i are independent of Xi, Y 3 and U 2 ; (b) holds since X x is 
independent of Y 3 and U 2 ; (c) holds since X\, U 2 and F 3 are jointly Gaussian, and (d) follows 
by straightforward algebra using the fact that E[y 3 |C/ 2 ] = (3U 2 , with 

P' 2 + a'Q' 



P^ + a' 2 Q r 



(E-13) 



Then, by adding ( |E-1 \) and ( |E-12[ ) we get Ti as given by ( |32| ) 
iii) Finally, we compute T 2 . It can be shown easily that 



I{U*S\U X ) 



1 ^ + c*' 2 Q\ 
2 1 ° g( ^^ ) - 



(E-14) 



Also, we have 



/([/ 2 ; y 3 |t/, f/i) = /(tf 2 ; x 1 + Y 3 \U, u x 

(e) 



IiU^Xt + Ys) 
h(X 1 + Y 3 )-h(X 1 + Y 3 \U 2 ) 

• i , M+E[y 3 2 ] 



2 

(9) 1 



log 
log 



E[Xf}+E[Y 3 2 ]-E[Y 3 E[Y 3 \U 2 }] 

P^ + Q' + 1 P 1 + N 3 
iV 3 + 7 P 1 + $(a',0,p' 2s 



(E-15) 
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where (e) holds since U and U\ are independent of U 2 , Xi and Y 3 ; (/) holds since X % , U 2 
and Y 3 are jointly Gaussian, and X\ is independent of U 2 and Y 3 ; and (g) follows through 



straightforward algebra similar to in ( |E-12| ). 



Adding T\ (given by (32)) and (E-15 ) and subtracting (E-14), we get T 2 as given by (33) 



F. Proof of Theorem |4] 

In this section we use the upper bound for the DM case in Theorem [2] to compute the upper 
bound on the capacity of the state-dependent full-duplex Gaussian RC with informed relay. 



Fix a joint distribution of Xi,X 2 , S, Y 2 , Y 3 of the form ( 18 ) satisfying 



E[X 2 ] =P 1 < P u E[X 2 2 ] = P 2 <P 



2- 



E[XxX 2 ] = ai2, E[X 2 S] = cr 2s , E[XtS] = 0. 
We shall also use the correlation coefficients p i2 and p 2s defined as 

0"12 0"2s 



Pl2 



\fPP, 



2 



P2s 



P 2 Q 



(E-16) 



(E-17) 



We first compute the first term in the minimization on the RHS of ( [T7| ). Let Y = (Xi 

Z 2 ,X l + Z z ) T . We have 

I(X 1 ;Y 2 ,Y 3 \S,X 2 ) = h(Y 2 ,Y 3 \S,X 2 ) - h(Y 2 , Y 3 \S, X u X 2 ) 

= h{X 1 + Z 2 ,X 1 + Z 3 \S,X 2 ) - h(Z 2 , Z 3 ) 



(a) 1 



2 

(6) 1 



log 



E [(Y - E[Y|S, X 2 ] J (Y - E[Y\S, X 2 \ 

- l -\og(N 2 N 3 ) 
E[YY T ] - E[E[Y|5, X 2 ]E[Y|5, X 2 f 



T- 



log ll + Pi 1- , 

2 V 1 - p^ s N 2 N 3 



N 2 N 3 
^— )(— + —)) 



(E-18) 



where, | • | denotes the determinant operator, 

(a) follows from the fact that the conditional differential entropy h(Xi + Z 2 , X\ + Z 3 \S, X 2 ) is 
maximized if (S, Xi, X 2 , Z 2 , Z 3 ) are jointly Gaussian, and 
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(b) follows from the fact the vector (S, X x , X 2 , Z 2 , Z 3 ) is a jointly Gaussian vector and the 
MMSE estimator of Y given (S,X 2 ) is 



E[Y\S,X 2 } = (- 



f 12a2s s + „ auQ X 2 )-(l, if. 

P 2 Q-al P 2 Q~al - 



(E-19) 



4 P2Q - °l 
We now compute the term [I(X U X 2 ; Y 3 \S) - I(Xr, S\Y 3 )\. We have 

I(X 1 ,X 2 ; Y 3 \S) - 7(X i; S\Y 3 ) = h(Y 3 \S) - h(Y 3 \X h X 2 , S) - h(S\Y 3 ) + h(S\X h Y 3 ) 

= h{Y 3 ) - h(S) + hiSlX^ Y 3 ) - h{Z 3 ). (E-20) 



For fixed second moments (E-16), we have 



h(Y 3 ) < - log(27re)(P 1 + P 2 + 2a 12 + 2a 2s + Q + N 3 ) 



(E-21) 



where equality is attained if Y 3 is Gaussian. Similarly, the term h(S\X 1 ,Y 3 ) is maximized if 
{S,Xx,Y 3 ) are jointly Gaussian. Let 5(Xi,F 3 ) = E[S|Xi,Y 3 ] be the MMSE estimator of S 
given (Xx,Y 3 ), i.e., 



S(X 1 ,Y 3 )=E[S\X 1 ,X 2 + S + Z 3 ] 

= ll X 1 + l2 {X 2 + S + Z 3 ) 



(E-22) 



with 



7i 



72 



0"12(Q + °2s 



P 1 (P 2 + 2a 2s + Q + N 3 )-a 



12 



A(Q + ^2, 



Pi(/ J 2 + 2a 2s + g + iV3)-a 1 2 2 



(E-23) 



Then we have 



h(S\X u Y 3 ) = h(S - S(X l , Y 3 )\X U Y 3 



<h{S- ll X 1 - l2 {X 2 + S + Z 3 )) 
1 

~ 2 
1 



log(2vre)E 5 - 7l Xi - 72 (X 2 + S + Z 3 



log (27re; 



QP 1 P 2 + PxiV 3 Q - of,^ - a\ 2 Q 



P 1 {P 2 + 2a 2s + Q + N 3 )-a 



12 



(E-24) 
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where the inequality is attained with equality if S, X 1 , X 2 , Y 3 are jointly Gaussian. From ( |E-20| ), 

I , / (A + P 2 + 2<t 12 + 2a 2s + g + 7V 3 ) 



(E-21 1 and (E-24), we obtain 



I(X 1 ,X 2 ;Y 3 \S)-I(X 1 ;S\Y 3 



log 



{P X P 2 + 2iW + PiQ + PiN 3 - a 



J log + 
2 V P, 1 



127 

(QAh + P.N.Q - ojJ\ - a 2 12 Q ) 
QN 3 

2^ " P 2 12 ~ Pis) + (VQ + P2sVP2) 2 + N 3 - 

(E-25) 



1 , / P 2 (l - p?n - p\ 

+ 2 l0g ( 1+ 



For convenience, let us define the function ©i(Pi, pi 2 , p 2s ) as the RHS of ( |E- 1 8[ ) and the 
function 02 (Pi, -P, P12, P2s) as the RHS of flE-25] ). From the above analysis, the capacity of the 
channel is upper-bounded as 

C < max min{0i(Pi,pi2,p2 S ),02(Pi,P2,Pi2 ) P2s)} (E-26) 
where the maximization is over all covariance matrices ^ Xl ,x 2 ,s,z 2 ,z i of (Xi,X 2 , S, Z 2 , Z 3 ), 

^■Xi,X2,S,Z2,Zs 

/ 



Pi 











P12V /PP2 

l>\:2 V P P P P2,\/P-2Q 





V 









p2 S yP2Q 





Q 





N 2 
N 3 



(E-27) 



that satisfy 



(E-28) 



Pi < Pi, P 2 < P 2 

and have non-negative discriminant, 

QP X P 2 N 2 N 3 {1- p{ 2 - pl s )>Q, 

i.e., for Q > 0, 

p? 2 + pL<1- 

Furthermore, investigating 0i(P, pi 2 , p 2s ) and 2 (P, P 2 , pi 2 , p 2s ), it can be seen that it suffices 
to consider p 12 G [0, 1] and p 2s e [—1,0] for the maximization in ( |E-26[ ). 



(E-29) 



(E-30) 
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To complete the proof, we should show that 0i(Pl, P12, P2s) and 6 2 (Pi, P 2 , pi 2 , P2s) are max- 
imized at Pi = Pi and P 2 = P 2 . It is easy to show that 0i(Pl, pi 2 , P2s) and 6 2 (Pi, P 2 , P12, P2s) 
increase monotonically with p for fixed pi 2 , p 2s , P 2 . Then we can replace p with Pi in both 
0i (Pi, P12, P2s) and © 2 (Pi, P 2 , P12, P2s)- To show that P 2 can be replaced by P 2 , we use the 
following intuitive argument. Since the term ©i(Pi, pi 2 , p 2s ) does not depend on P 2 for given p i2 
and p 2s , it remains to show that P 2 can be replaced with P 2 in only the term 2 (Pi, P2, P12, P2s)- 
The term 2 (Pi, P 2 , P12, P2s) is the sum rate of a two-user MAC with asymmetric CSI in which 
the informed encoder knows the message of the uninformed encoder [16, Theorem 6]. Then, 
considering this MAC, it can be argued [16] that for the sum-rate to be maximized the informed 
encoder should use the entire power available, i.e., P 2 . This concludes the proof of Theorem [4] 

G. Proof of Observation [7] 

We first prove the first statement in Observation [TJ Let us denote N 2 as the RHS of (|42]). We 
have 

(«) . rl , Pi, 



R l ° > min|-log(l + 



N 2 



max - log (l + ^ —=) + - log(l 1 Ps(1 ~ /4i 



i<pL<o 2 V P 2 + Q + A^ 3 + 2p' 2s ^P2~QJ 2 ov ^ 3 

(6) 1 , /, . Pi v 
= 2 1 ° g(1 + iV 2 ) 

:= Pdg, (F-31) 



where (a) follows by putting p' 12 = and 6 = 1 in ([26]), and (6) follows if N 2 > N£. 
Then, it is easy to observe that 

Pdg < Pdg- (F-32) 



From ( |F-31[ ) and ( |F-32[ ), we get that 

Pdg < Pq ^ Cdg < Pdg — Pdg- (F-33) 

Then we can conclude that the lower bound and upper bound meet if N 2 > N£. 

Let us now prove the second statement in Observation [T] If the pair (pi 2 ,p 2s ) that maximizes 
the upper bound in Corollary [4] satisfies the condition in ( f36| ) with equality, i.e., p\ 2 + p\ s = 1, 
then we choose g 2s = P2s, Q12 = P12, and 9 = g% s ( i.e., 9 = q\ 2 ) in the lower bound < [39| ) to 
achieve the upper bound, and thus obtain channel capacity in this case. 
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H. Proofs for Time Division Relaying 

1) Proof of Proposition^ Let (x[% x[% . . . , 1^) and (xj 2 ^, X® mj+2 , . . . , xg) 
be the transmitted sequences from the source during the relay-receive period and the relay- 
transmit period, respectively. The relay receives Y 2> i, Y 2j2 , . . . , Y 2 ^ vn j during the relay-receive 
period and transmits a sequence X 2: [un\+i, X 2i \_ vn \j r2 , • • • , X 2>n during the relay-transmit period. 
From Fano's inequality ( |C-20[ ) and Lemma [T] we have the following 



( n 

nR< min < y] I{X Xti \ Y 2)i , Y 3 ,i\S u X 2 
^ i=i 

n . 

^2l(X lti ,X 2 s,Y 3 j\Si) -J(X 1)i ;5 , i |y 3 ,i) [ +™5„. (G-34) 

We now specialize this bound to the TD mode for which we have X 2)i = for i < \yn\ (as the 
relay does not transmit during the relay-receive period) and Y 2;i = for i > \yn\ + 1 (as the 
relay does not receive during the relay-transmit period). This gives 

j2nx<3;Y 2 „Yff\s?\x 2 , = o)+ nx&YgisjptX^, 

i=l i=[un]+l 
\un\ 



i=i 



n . 

nxl'LXtf, Y^lSh - I(x[f, s!»\Yg) [ + n5 n . (G-35) 

i=[un]+l ' 

By letting n — > oo and using standard arguments [46], we get the single letter upper bound on 
capacity 

C < max min ji/^lf'; Y 2 , Y 3 {1) \S W , X 2 = 0) + vI(X {2) - Y 3 (2) \S {2 \ X 2 ), 

ui(x[ 1) -Y 3 (1) \s^,x 2 = o) - uiix^-s^iY^) 

+ vI{X?\x 2] Y®\SP>) - vl(x[ 2) ; S^Y™)}, (G-36) 
where the maximization is over all joint distributions of the form 

Q sm P x m W Y2X w {x w _ 5(1) Q s( 2) P x( 2) P X2{X (2) iS(2) W Y w {x m >X2 >sW ■ (G-37) 



The bound in (G-36) is the counterpart, to the TD mode, of the upper bound ([17]) for the full 



duplex case. By closely following the arguments and the algebra used in the proof of Theorem |4} 
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it can be shown that this bound is maximized by choosing S^ 2 \ X{ , x{ , X 2 , Y 2 , Y^ x \ K 



'(1) v( 2 ) 



that are jointly Gaussian, with Xj with power P{ is independent of and X} and X 

(2) 

with power P± and P 2 , respectively, are such that 



E[x{ 2) X 2 ] = Pl2 \/P^P2, E[X[ 2) S^] = 0, E[X 2 S®} = p2sVF^. 



,(2) 



.(2) 



Using this distribution, the evaluation of the RHS of ( |G-36[ ) gives the RHS of ( |5Tj ). 

2) Proof of Proposition^ The proof follows by combining the technique of rate-splitting 



[52] and the Generalized DPC described in Section IV-A for the full-duplex mode. Rate splitting 
has the message W to be transmitted from the source node split into two independent parts: Wd 
transmitted directly to the destination at rate Rd, and W r transmitted through the relay at rate 
R r , with a total rate R = R r + R d . 

The encoding and transmission scheme is as follows. During the relay-receive period, the 
source sends a Gaussian signal which carries W r only and is independently drawn with a 



random variable X^ ~ N(0, P^) which is independent of the channel state During the 



relay-transmit period, the source transmits a Gaussian signal x[ 2 - which carries both W r and 



Wd and is independently drawn with X\ ~ N(0, P^')- During the relay-transmit period, the 
relay sends a Gaussian signal X 2) j which carries W r only and is given by 

X 2>i = [/ M + X 2ii , (G-38) 

where Un is drawn with Ui ~ 3\f(0, 9P 2 ) and X 2ji is obtained via a GDPC considering as 
non-causal channel state information during this period. 



(2)> 



The random variables U x and x[ 2) are jointly Gaussian with E[X{ Zj X 2 ] = EX}^] = p' 12 J 6P^> P 2 , 
and are both independent of the state S^. For the GDPC, we use the following auxiliary random 
variable to generate the auxiliary codewords Un, 



(2) 



'(2), 



,(2) 



u 2 = x 2 + 



a'(l + p' 2s 



9P 2 



P2s 



6P 2 



(G-39) 



where X 2 ~ N(0, 9P 2 ) is jointly Gaussian with S®, with E[X 2 S^] = E[X 2 ^ 2) ] = p' 2s ^9P 2 QW; 
and a' is a scale parameter. Thus, using the GDPC given by ( |G-39[ ), X 2yi is generated as 



X 



2,1 



u, 



2,'l 



a'(l + p' 2s 



9P 2 . 



P2s 



6P 2 



S 



(2) 



(G-40) 



where U 2)i is independently drawn with U 2 . 
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(2) / (2) ~ ~ (2) ~ (2) 

Furthermore, we let X{*( = Pi2V p i /OPiUis + Xfi, where Xfi is independently drawn with 
Xf ] ~ N(0, (1 - pi 2 2 )Pi (2) ), is independent of U h X 2 , S®, and carries W d only. 

For the decoding procedures at the source and the relay, we give simple arguments based 
on intuition (the rigorous decoding uses joint typicality arguments). Also, since all the random 
variables are i.i.d., we sometimes omit the time index. The relay subtracts out from the 
received Y 2 and then decodes W r . Message W r can be decoded correctly at the relay as long as 



v 



R r < - log 1 + 



Pi 



(i) 



2 "° V" ' N 2 

The destination jointly decodes W r and Wd from (Kj , Kj , 
done reliable as long as 

R d < v[I{Xf\ U 2 ; YPp) - I(U 2 ; S^U,)] 



Rr + RaKuIiX^iY^) 



v[I{Xf\u x ,U 2 ,Y^)-I{U 2l S^\U l )\. 



(G-41) 

One can show that this can be 



(G-42) 
(G-43) 
(G-44) 



Adding ( |G-41[ ) and ( |G-42| ) at one hand, and < |G-41[ ) and ( ]G-43[ ) at the other hand, and using 
< |G-44] >, we obtain 



J2<^log(l + ^ ' nr/vCO.vO»), 



+ PJ(X 1 w ;y 8 w |I7 1) l7 2 ) 



< ^ log(l + ^-) + i/[I(x{ a) , C/ 2 ; F 3 (2 Vi) - I(U 2 ; S^U,)] 



(G-45) 

(G-46) 
(G-47) 



r < vi{x?- r 3 (1) ) + mx?\ u u u 2 - r 3 (2) ) - i(u 2 , s^lu,)}. 

The computation of the mutual information terms in ( |G-45[ ), ( |G-46[ ) and ( |G-47[ ) involves straight- 
forward algebra which is very similar to that in the proofs of Theorem [3] in Appendix [D] and of 
Corollary [T] in Appendix [EJ and, so, we omit the details for brevity. More specifically, define 

P' 2 := 6P 2 (1 - pg), Q'W := (VQW + p' 2s V6P 2 ) 2 . 



Also, recall <&(a', 0,p 2s ) as defined in ( f55| ). Then, we have the following. 



The mutual information on the RHS of (G-45 ) can be computed as in (E-12) to obtain 



I(x[ 2) -Y 3 {2) \U 1 ,U 2 ) = i-\og[l 



1 



(1 ~ (&) P ( i ] 
N 3 + $(a',e iP2s 



(G-48) 
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The conditional mutual information difference on the RHS of ( |G-46| ) is similar to T 2 in Appendix 
[E] and it gives 



i log f P'2{P'i + Q' {2) + n 3 + (1 - p &)p1 2) ' 



. (G-49) 



The evaluation of the term [I{xf\ U u U 2 ; F 3 (2) ) - I(U 2 ; S (2) |?7i)] is similar to that of ( p52) in 
Appendix |D} and we obtain 



1 / p 1 (2) +^p 2 +2p' 12V /^r^2 x i 

- log H . v + - log 

2 V 9 p 2 + Q (2) + 2f/ 2sV /eP 2 QW + N 3 J 2 



,(2) 



'(2)0 ■ 



PIQ'W{1 - a') 2 + N 3 {P^ + a' 2 Q 

(G-50) 



Also, it is easy to show that 



I(X«;Y«; 



itog(i 



PI 



(i) 



(G-51) 



Finally, we obtain ( |54aD using ( |G-45D and ( |G-48| ); we obtain ( |54b[ ) using ( |G-46| ) and ( |G-49| ); 



and we obtain ( |54c| ) using ( |G-47[ ), ( |G-50[ ) and ( |G-51[ ). This completes the proof. 
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